2025-05-07T19:42:32.1369681Z Current runner version: '2.323.0' 2025-05-07T19:42:32.1375326Z Runner name: 'i-0fc7bbcb5d3569138' 2025-05-07T19:42:32.1376184Z Machine name: 'ip-10-0-45-138' 2025-05-07T19:42:32.1378718Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:32.1380748Z Contents: read 2025-05-07T19:42:32.1381336Z Metadata: read 2025-05-07T19:42:32.1381865Z Packages: read 2025-05-07T19:42:32.1382344Z ##[endgroup] 2025-05-07T19:42:32.1384678Z Secret source: None 2025-05-07T19:42:32.1385642Z Prepare workflow directory 2025-05-07T19:42:32.1986373Z Prepare all required actions 2025-05-07T19:42:32.2023975Z Getting action download info 2025-05-07T19:42:32.3726636Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:32.6376817Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:33.1547732Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.11, 11.8.0, clang) 2025-05-07T19:42:33.2340745Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:33.2452522Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:33.2462304Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:33.2463176Z ##[endgroup] 2025-05-07T19:42:34.5080473Z Runner Type: linux.24xlarge 2025-05-07T19:42:34.5080936Z Instance Type: c5.24xlarge 2025-05-07T19:42:34.5081254Z AMI Name: unknown 2025-05-07T19:42:34.5111599Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:39.5623106Z ##[group]Checking docker version 2025-05-07T19:42:39.5635964Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:39.5842559Z '1.44' 2025-05-07T19:42:39.5860818Z Docker daemon API version: '1.44' 2025-05-07T19:42:39.5861365Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:39.6045861Z '1.44' 2025-05-07T19:42:39.6064716Z Docker client API version: '1.44' 2025-05-07T19:42:39.6069569Z ##[endgroup] 2025-05-07T19:42:39.6072093Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:39.6076653Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=761220" 2025-05-07T19:42:39.6227412Z ##[command]/usr/bin/docker network prune --force --filter "label=761220" 2025-05-07T19:42:39.6367482Z ##[endgroup] 2025-05-07T19:42:39.6367823Z ##[group]Create local container network 2025-05-07T19:42:39.6376871Z ##[command]/usr/bin/docker network create --label 761220 github_network_5669e03931344ab5bb72aa11fef66996 2025-05-07T19:42:39.8709771Z bb024eeae256576539fe9d3a87d67729691f3a54b3adfc7996fc4c628f3b364a 2025-05-07T19:42:39.8724734Z ##[endgroup] 2025-05-07T19:42:39.8747707Z ##[group]Starting job container 2025-05-07T19:42:39.8767356Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:40.0714055Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:40.1306861Z 1c3112c87ab2: Pulling fs layer 2025-05-07T19:42:40.6933943Z 1c3112c87ab2: Verifying Checksum 2025-05-07T19:42:40.6935848Z 1c3112c87ab2: Download complete 2025-05-07T19:42:42.1456238Z 1c3112c87ab2: Pull complete 2025-05-07T19:42:42.1602350Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:42.1652481Z Status: Downloaded newer image for amazonlinux:2023 2025-05-07T19:42:42.1681651Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:42.1772999Z ##[command]/usr/bin/docker create --name 0f0548cb111f43f4969935384500e226_amazonlinux2023_99f7db --label 761220 --workdir /__w/FBGEMM/FBGEMM --network github_network_5669e03931344ab5bb72aa11fef66996 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:42.5168227Z f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 2025-05-07T19:42:42.5193194Z ##[command]/usr/bin/docker start f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 2025-05-07T19:42:43.0551062Z f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 2025-05-07T19:42:43.0569117Z ##[command]/usr/bin/docker ps --all --filter id=f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:43.0710236Z f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 Up Less than a second 2025-05-07T19:42:43.0726159Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 2025-05-07T19:42:43.0877039Z HOME=/github/home 2025-05-07T19:42:43.0877644Z GITHUB_ACTIONS=true 2025-05-07T19:42:43.0877987Z CI=true 2025-05-07T19:42:43.0878384Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:43.0896507Z ##[endgroup] 2025-05-07T19:42:43.0907754Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:43.0910060Z ##[endgroup] 2025-05-07T19:42:43.0996055Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:43.0997084Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:43.0998132Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:43.0998605Z env: 2025-05-07T19:42:43.0998938Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:43.0999409Z BUILD_ENV: build_binary 2025-05-07T19:42:43.0999741Z BUILD_TARGET: default 2025-05-07T19:42:43.1000110Z BUILD_VARIANT: cuda 2025-05-07T19:42:43.1000464Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:43.1000850Z ##[endgroup] 2025-05-07T19:42:43.9666392Z Amazon Linux 2023 repository 63 MB/s | 37 MB 00:00 2025-05-07T19:42:50.5184051Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:42:51.0740875Z Dependencies resolved. 2025-05-07T19:42:51.0915945Z Nothing to do. 2025-05-07T19:42:51.0917553Z Complete! 2025-05-07T19:42:51.3370512Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:42:51.4000214Z Dependencies resolved. 2025-05-07T19:42:51.4228348Z ======================================================================================== 2025-05-07T19:42:51.4228945Z Package Arch Version Repository Size 2025-05-07T19:42:51.4229679Z ======================================================================================== 2025-05-07T19:42:51.4230133Z Installing: 2025-05-07T19:42:51.4230641Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:51.4231298Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:51.4231852Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:51.4232623Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:51.4233214Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:51.4233786Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:51.4234421Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:51.4234964Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:51.4235462Z Installing dependencies: 2025-05-07T19:42:51.4235950Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:51.4236597Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:51.4237524Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4238257Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:51.4238909Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:51.4239490Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:51.4240130Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:51.4240697Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:51.4241299Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:51.4241979Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:51.4242569Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:51.4243175Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:51.4243881Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:51.4244516Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:51.4245220Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:51.4245766Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:51.4246357Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:51.4247003Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:51.4347676Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:51.4348486Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:51.4349269Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:51.4349830Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:51.4350365Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:51.4350923Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:51.4351504Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:51.4352084Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:51.4352684Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:51.4353244Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:51.4353822Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:51.4354335Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4354925Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:51.4355462Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:51.4356024Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:51.4356639Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:51.4357275Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:51.4357879Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:51.4358670Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:51.4359259Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:51.4359822Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:51.4360378Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:51.4360942Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4361515Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:51.4362083Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:51.4362663Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:51.4363299Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:51.4363946Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:51.4364522Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:51.4365336Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:51.4365921Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:51.4366524Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:51.4367107Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:51.4367674Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:51.4368243Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:51.4368780Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:51.4369338Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:51.4369917Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:51.4370500Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:51.4371060Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:51.4371603Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:51.4372197Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:51.4372789Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:51.4373390Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:51.4373970Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:51.4374579Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:51.4375207Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:51.4375773Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:51.4376410Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:51.4376933Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:51.4377501Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:51.4378070Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:51.4378610Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:51.4379260Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:51.4379860Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:51.4380452Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:51.4380997Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:51.4381503Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:51.4382045Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:51.4382562Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:51.4383098Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:51.4383610Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:51.4384124Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:51.4384633Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:51.4385159Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:51.4385822Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:51.4386582Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:51.4387162Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:51.4387748Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:51.4388303Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:51.4388860Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:51.4389410Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:51.4389954Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:51.4390499Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:51.4391042Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:51.4391498Z Installing weak dependencies: 2025-05-07T19:42:51.4391966Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:51.4392632Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:51.4393254Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:51.4393857Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:51.4394451Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:51.4395026Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:51.4395399Z 2025-05-07T19:42:51.4395501Z Transaction Summary 2025-05-07T19:42:51.4395795Z ======================================================================================== 2025-05-07T19:42:51.4396126Z Install 107 Packages 2025-05-07T19:42:51.4396276Z 2025-05-07T19:42:51.4396432Z Total download size: 38 M 2025-05-07T19:42:51.4396693Z Installed size: 151 M 2025-05-07T19:42:51.4396955Z Downloading Packages: 2025-05-07T19:42:51.5425676Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.8 MB/s | 82 kB 00:00 2025-05-07T19:42:51.5565184Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 22 MB/s | 786 kB 00:00 2025-05-07T19:42:51.5581409Z (3/107): elfutils-debuginfod-client-0.188-3.amz 2.7 MB/s | 41 kB 00:00 2025-05-07T19:42:51.5874331Z (4/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 79 MB/s | 5.3 MB 00:00 2025-05-07T19:42:51.5888702Z (5/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.9 MB/s | 54 kB 00:00 2025-05-07T19:42:51.5938704Z (6/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 16 MB/s | 539 kB 00:00 2025-05-07T19:42:51.6097088Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 69 MB/s | 1.1 MB 00:00 2025-05-07T19:42:51.6274122Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 75 MB/s | 2.8 MB 00:00 2025-05-07T19:42:51.6360985Z (9/107): groff-base-1.22.4-7.amzn2023.0.2.x86_6 48 MB/s | 1.0 MB 00:00 2025-05-07T19:42:51.6576017Z (10/107): git-core-2.47.1-1.amzn2023.0.2.x86_64 70 MB/s | 4.7 MB 00:00 2025-05-07T19:42:51.6593292Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.4 MB/s | 160 kB 00:00 2025-05-07T19:42:51.6742497Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 44 MB/s | 1.6 MB 00:00 2025-05-07T19:42:51.6751267Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 3.4 MB/s | 46 kB 00:00 2025-05-07T19:42:51.6782297Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 3.8 MB/s | 62 kB 00:00 2025-05-07T19:42:51.6817944Z (15/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 10 MB/s | 57 kB 00:00 2025-05-07T19:42:51.6892847Z (16/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 68 MB/s | 756 kB 00:00 2025-05-07T19:42:51.6921297Z (17/107): less-608-2.amzn2023.0.2.x86_64.rpm 10 MB/s | 168 kB 00:00 2025-05-07T19:42:51.6932172Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.4 MB/s | 28 kB 00:00 2025-05-07T19:42:51.7015345Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 9.4 MB/s | 108 kB 00:00 2025-05-07T19:42:51.7036744Z (20/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 10 MB/s | 95 kB 00:00 2025-05-07T19:42:51.7073866Z (21/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 11 MB/s | 153 kB 00:00 2025-05-07T19:42:51.7113477Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 4.8 MB/s | 31 kB 00:00 2025-05-07T19:42:51.7144029Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 11 MB/s | 106 kB 00:00 2025-05-07T19:42:51.7168242Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 13 MB/s | 121 kB 00:00 2025-05-07T19:42:51.7187957Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.2 MB/s | 26 kB 00:00 2025-05-07T19:42:51.7280625Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 63 MB/s | 706 kB 00:00 2025-05-07T19:42:51.7289839Z (27/107): nano-default-editor-8.3-1.amzn2023.no 869 kB/s | 10 kB 00:00 2025-05-07T19:42:51.7321178Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 29 MB/s | 394 kB 00:00 2025-05-07T19:42:51.7421942Z (29/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 27 MB/s | 256 kB 00:00 2025-05-07T19:42:51.7472007Z (30/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 39 MB/s | 573 kB 00:00 2025-05-07T19:42:51.7502858Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 25 MB/s | 454 kB 00:00 2025-05-07T19:42:51.7558996Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 56 MB/s | 708 kB 00:00 2025-05-07T19:42:51.7610655Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 54 MB/s | 542 kB 00:00 2025-05-07T19:42:51.7623171Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 8.2 MB/s | 93 kB 00:00 2025-05-07T19:42:51.7652379Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.2 MB/s | 41 kB 00:00 2025-05-07T19:42:51.7697510Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 3.0 MB/s | 22 kB 00:00 2025-05-07T19:42:51.7729738Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 18 MB/s | 179 kB 00:00 2025-05-07T19:42:51.7740251Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 3.3 MB/s | 29 kB 00:00 2025-05-07T19:42:51.7757267Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 4.1 MB/s | 22 kB 00:00 2025-05-07T19:42:51.7787404Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 9.8 MB/s | 55 kB 00:00 2025-05-07T19:42:51.7808843Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 4.0 MB/s | 26 kB 00:00 2025-05-07T19:42:51.7828127Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.5 MB/s | 36 kB 00:00 2025-05-07T19:42:51.7842137Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.7 MB/s | 26 kB 00:00 2025-05-07T19:42:51.7973179Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 103 MB/s | 1.7 MB 00:00 2025-05-07T19:42:51.7985590Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 973 kB/s | 15 kB 00:00 2025-05-07T19:42:51.7998585Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.7 MB/s | 41 kB 00:00 2025-05-07T19:42:51.8026602Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 6.4 MB/s | 31 kB 00:00 2025-05-07T19:42:51.8054195Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 4.6 MB/s | 21 kB 00:00 2025-05-07T19:42:51.8078257Z (49/107): perl-File-Find-1.37-477.amzn2023.0.6. 5.2 MB/s | 26 kB 00:00 2025-05-07T19:42:51.8096278Z (50/107): perl-File-Basename-2.85-477.amzn2023. 1.9 MB/s | 18 kB 00:00 2025-05-07T19:42:51.8114459Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.1 MB/s | 36 kB 00:00 2025-05-07T19:42:51.8153077Z (52/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.5 MB/s | 17 kB 00:00 2025-05-07T19:42:51.8179336Z (53/107): perl-File-Temp-0.231.100-2.amzn2023.0 6.4 MB/s | 60 kB 00:00 2025-05-07T19:42:51.8186788Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.2 MB/s | 16 kB 00:00 2025-05-07T19:42:51.8210704Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 12 MB/s | 60 kB 00:00 2025-05-07T19:42:51.8243003Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 3.1 MB/s | 16 kB 00:00 2025-05-07T19:42:51.8268972Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 5.6 MB/s | 42 kB 00:00 2025-05-07T19:42:51.8279037Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 7.9 MB/s | 56 kB 00:00 2025-05-07T19:42:51.8304568Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 17 MB/s | 87 kB 00:00 2025-05-07T19:42:51.8340082Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 8.1 MB/s | 42 kB 00:00 2025-05-07T19:42:51.8379355Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 24 MB/s | 218 kB 00:00 2025-05-07T19:42:51.8393066Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 2.6 MB/s | 23 kB 00:00 2025-05-07T19:42:51.8412084Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 5.1 MB/s | 31 kB 00:00 2025-05-07T19:42:51.8428219Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.8 MB/s | 13 kB 00:00 2025-05-07T19:42:51.8494788Z (65/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 47 MB/s | 392 kB 00:00 2025-05-07T19:42:51.8514999Z (66/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 1.9 MB/s | 23 kB 00:00 2025-05-07T19:42:51.8540770Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 8.7 MB/s | 97 kB 00:00 2025-05-07T19:42:51.8559984Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 14 MB/s | 85 kB 00:00 2025-05-07T19:42:51.8608681Z (69/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 14 MB/s | 84 kB 00:00 2025-05-07T19:42:51.8627922Z (70/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 2.5 MB/s | 20 kB 00:00 2025-05-07T19:42:51.8660765Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 22 MB/s | 215 kB 00:00 2025-05-07T19:42:51.8682413Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 5.9 MB/s | 41 kB 00:00 2025-05-07T19:42:51.8725043Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 12 MB/s | 71 kB 00:00 2025-05-07T19:42:51.8745979Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 1.6 MB/s | 12 kB 00:00 2025-05-07T19:42:51.8762389Z (75/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 7.2 MB/s | 55 kB 00:00 2025-05-07T19:42:51.8790543Z (76/107): perl-Storable-3.21-458.amzn2023.0.2.x 15 MB/s | 96 kB 00:00 2025-05-07T19:42:51.8815357Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 3.3 MB/s | 15 kB 00:00 2025-05-07T19:42:51.8834445Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 7.8 MB/s | 48 kB 00:00 2025-05-07T19:42:51.8846006Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.2 MB/s | 22 kB 00:00 2025-05-07T19:42:51.8883450Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 5.6 MB/s | 36 kB 00:00 2025-05-07T19:42:51.8894665Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 2.7 MB/s | 17 kB 00:00 2025-05-07T19:42:51.8921115Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.2 MB/s | 22 kB 00:00 2025-05-07T19:42:51.8960710Z (83/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 18 MB/s | 108 kB 00:00 2025-05-07T19:42:51.8980242Z (84/107): perl-Time-Local-1.300-5.amzn2023.0.2. 4.3 MB/s | 34 kB 00:00 2025-05-07T19:42:51.8988952Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.5 MB/s | 17 kB 00:00 2025-05-07T19:42:51.9020513Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 4.6 MB/s | 23 kB 00:00 2025-05-07T19:42:51.9075061Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 1.8 MB/s | 14 kB 00:00 2025-05-07T19:42:51.9095700Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 6.8 MB/s | 71 kB 00:00 2025-05-07T19:42:51.9121639Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 1.5 MB/s | 15 kB 00:00 2025-05-07T19:42:51.9322763Z (90/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 92 MB/s | 2.0 MB 00:00 2025-05-07T19:42:51.9341388Z (91/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.3 MB/s | 29 kB 00:00 2025-05-07T19:42:51.9407781Z (92/107): perl-overload-1.31-477.amzn2023.0.6.n 5.8 MB/s | 46 kB 00:00 2025-05-07T19:42:51.9413857Z (93/107): perl-overloading-0.02-477.amzn2023.0. 1.7 MB/s | 13 kB 00:00 2025-05-07T19:42:51.9465217Z (94/107): perl-parent-0.238-458.amzn2023.0.2.no 3.3 MB/s | 14 kB 00:00 2025-05-07T19:42:51.9487208Z (95/107): perl-podlators-4.14-458.amzn2023.0.2. 16 MB/s | 112 kB 00:00 2025-05-07T19:42:51.9523053Z (96/107): perl-subs-1.03-477.amzn2023.0.6.noarc 2.3 MB/s | 12 kB 00:00 2025-05-07T19:42:51.9553934Z (97/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 2.6 MB/s | 126 kB 00:00 2025-05-07T19:42:51.9562182Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 1.8 MB/s | 13 kB 00:00 2025-05-07T19:42:51.9693741Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 69 MB/s | 1.1 MB 00:00 2025-05-07T19:42:51.9716280Z (100/107): sudo-python-plugin-1.9.15-1.p5.amzn2 3.8 MB/s | 56 kB 00:00 2025-05-07T19:42:51.9800704Z (101/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 55 MB/s | 1.3 MB 00:00 2025-05-07T19:42:51.9856688Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 38 MB/s | 613 kB 00:00 2025-05-07T19:42:51.9916746Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 48 MB/s | 879 kB 00:00 2025-05-07T19:42:52.0080908Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 80 MB/s | 2.2 MB 00:00 2025-05-07T19:42:52.0146203Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 16 MB/s | 432 kB 00:00 2025-05-07T19:42:52.0191689Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 31 MB/s | 779 kB 00:00 2025-05-07T19:42:52.0204371Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 4.0 MB/s | 42 kB 00:00 2025-05-07T19:42:52.0225174Z -------------------------------------------------------------------------------- 2025-05-07T19:42:52.0225667Z Total 63 MB/s | 38 MB 00:00 2025-05-07T19:42:53.0690037Z Running transaction check 2025-05-07T19:42:53.1153066Z Transaction check succeeded. 2025-05-07T19:42:53.1153413Z Running transaction test 2025-05-07T19:42:53.4843928Z Transaction test succeeded. 2025-05-07T19:42:53.4846046Z Running transaction 2025-05-07T19:42:54.1654929Z Preparing : 1/1 2025-05-07T19:42:54.1787277Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:54.2034616Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:54.2234484Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:54.2276190Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:54.2346666Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:54.2434036Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:54.2700586Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:54.2753179Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:54.2799899Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:54.3304373Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:54.3360066Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:54.3634978Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:54.3701202Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:54.3752187Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:54.3801096Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:54.3843874Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:54.3977161Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:54.4025200Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:54.4071208Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:54.4137401Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:54.4187603Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:54.4231136Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:54.4660694Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:54.4739943Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:54.4882314Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:54.5317193Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:54.5496070Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:54.6331846Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:54.6332580Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:54.6333086Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:54.6333347Z 2025-05-07T19:42:54.6537627Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:54.6866534Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:54.7068731Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:54.7140047Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:54.8254056Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:54.9770762Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:54.9907549Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:55.0316702Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.0398718Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.0474661Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:55.0554442Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:55.0641591Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:55.0695750Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:55.0745313Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:55.0797956Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:55.0891334Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:55.0961876Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:55.1063026Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:55.1276775Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:55.1373422Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:55.1424741Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:55.1474551Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:55.1531772Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:55.1591265Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:55.1656480Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:55.1742755Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:55.1812609Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:55.1857340Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:55.1916012Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:55.1979558Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:55.2038416Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:55.2083128Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:55.2141480Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:55.2211841Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:55.2266359Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:55.2375151Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:55.2461764Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:55.2516775Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:55.2570184Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:55.2618719Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:55.2695096Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:55.2794948Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:55.2873210Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:55.2930907Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:55.2984914Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:55.3061794Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:55.3118987Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:55.3179284Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:55.3250442Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:55.3298963Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:55.3347699Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:55.3410411Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:55.3494418Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:55.3574036Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:55.3637707Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:55.3703808Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:55.3754949Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:55.3802516Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:55.3868938Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:55.3916361Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:55.3972179Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:55.4031362Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:55.4089536Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:55.4173340Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:55.4711917Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:55.5680336Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:55.5813537Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:55.5892480Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:55.5966530Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:55.6031315Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:55.6096934Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:55.6151996Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:55.6213594Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:55.6284436Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:55.6491517Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:55.6621304Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:55.6705642Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:55.7108969Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:55.8339362Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:55.8432782Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:55.8542767Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:55.8843459Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:55.8942736Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:55.9187385Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:55.9402115Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:55.9487041Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:55.9606845Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:56.7270154Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:56.7270959Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:56.7271543Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:56.7272384Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:56.7273226Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:56.7273851Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:56.7274563Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:56.7275232Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:56.7275844Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:56.7276783Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:56.7277383Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:56.7277991Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:56.7278629Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:56.7279272Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:56.7279885Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:56.7280513Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:56.7281165Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:56.7281742Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:56.7282440Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:56.7283142Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:56.7283745Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:56.7284447Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:56.7285042Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:56.7285874Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:56.7286611Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:56.7287229Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:56.7287850Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:56.7288508Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:56.7289175Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:56.7289756Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:56.7290446Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:56.7291157Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:56.7291765Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:56.7292448Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:56.7293030Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:56.7293857Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:56.7294563Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:56.7295173Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:56.7295815Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:56.7296485Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:56.7297180Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:56.7297811Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:56.7298387Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:56.7298933Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:56.7299492Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:56.7300018Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:56.7300679Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:56.7301270Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:56.7301813Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:56.7302387Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:56.7302948Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:56.7303523Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:56.7304091Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:56.7304642Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:56.7305220Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:56.7305777Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:56.7306347Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:56.7306912Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:56.7307452Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:56.7308008Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:56.7308544Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:56.7309120Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:56.7309674Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:56.7310253Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:56.7310823Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:56.7311372Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:56.7311928Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:56.7312462Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:56.7313109Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:56.7313671Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:56.7314219Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:56.7314784Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:56.7315407Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:56.7315961Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:56.7316518Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:56.7317079Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:56.7317623Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:56.7318160Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:56.7318733Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:56.7319292Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:56.7319863Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:56.7320437Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:56.7321027Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:56.7321595Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:56.7322230Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:56.7322780Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:56.7323320Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:56.7323877Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:56.7324428Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:56.7324958Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:56.7325510Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:56.7326041Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:56.7326587Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:56.7327143Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:56.7327702Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:56.7328267Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:56.7328797Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:56.7329359Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:56.7329891Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:56.7330435Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:56.7330971Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:56.7331506Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:56.7332078Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:56.7332591Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:56.7333117Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:56.7333651Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:56.7334191Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:56.8345281Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:56.8345657Z 2025-05-07T19:42:56.8345837Z Installed: 2025-05-07T19:42:56.8346202Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:56.8347093Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8347701Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:56.8348360Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8348930Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8349446Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8349940Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8350480Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.8351014Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:56.8351532Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8352059Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:56.8352656Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:56.8353309Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:56.8353915Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:56.8354419Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8354932Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8355443Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8355968Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:56.8356513Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8357079Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8357662Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8358213Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8358767Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8359326Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8359866Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8360390Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:56.8360935Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:56.8361498Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8362035Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:56.8362557Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:56.8363101Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:56.8363670Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:56.8364204Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8364825Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8365325Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8365867Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8366373Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8366882Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.8367500Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8368041Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8368592Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.8369136Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8369722Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8370267Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8370976Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8371536Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:56.8372084Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:56.8372659Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8373208Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8373852Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8374442Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.8374978Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.8375540Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8376077Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8376645Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.8377185Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8377740Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.8378294Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:56.8378813Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8379366Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:56.8379919Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.8380492Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8381035Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8381627Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:56.8382198Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8382728Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:56.8383290Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8383829Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8384407Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.8384991Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:56.8385542Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.8386524Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.8387108Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8387738Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8388321Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8389045Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8389662Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8390269Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:56.8390901Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.8391501Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8392150Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.8392876Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:56.8393484Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:56.8394064Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:56.8394610Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8395188Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:56.8395742Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8396468Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8397032Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8397574Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:56.8398128Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8398648Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:56.8399214Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8399790Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8400376Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.8400946Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:56.8401503Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8402064Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:56.8402602Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8403134Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:56.8403685Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:56.8404243Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:56.8404763Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8405370Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8405910Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8406416Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:56.8406906Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:56.8407207Z 2025-05-07T19:42:56.8407309Z Complete! 2025-05-07T19:42:56.9190669Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:56.9191032Z with: 2025-05-07T19:42:56.9191335Z submodules: true 2025-05-07T19:42:56.9191603Z repository: pytorch/FBGEMM 2025-05-07T19:42:56.9192102Z token: *** 2025-05-07T19:42:56.9192330Z ssh-strict: true 2025-05-07T19:42:56.9192774Z ssh-user: git 2025-05-07T19:42:56.9193030Z persist-credentials: true 2025-05-07T19:42:56.9193337Z clean: true 2025-05-07T19:42:56.9193676Z sparse-checkout-cone-mode: true 2025-05-07T19:42:56.9194215Z fetch-depth: 1 2025-05-07T19:42:56.9194482Z fetch-tags: false 2025-05-07T19:42:56.9194727Z show-progress: true 2025-05-07T19:42:56.9195008Z lfs: false 2025-05-07T19:42:56.9195243Z set-safe-directory: true 2025-05-07T19:42:56.9195541Z env: 2025-05-07T19:42:56.9195782Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:56.9196143Z BUILD_ENV: build_binary 2025-05-07T19:42:56.9196411Z BUILD_TARGET: default 2025-05-07T19:42:56.9196691Z BUILD_VARIANT: cuda 2025-05-07T19:42:56.9197028Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:56.9197298Z ##[endgroup] 2025-05-07T19:42:56.9242701Z ##[command]/usr/bin/docker exec f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:57.2436686Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:57.2438140Z ##[group]Getting Git version info 2025-05-07T19:42:57.2438493Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:57.2439037Z [command]/usr/bin/git version 2025-05-07T19:42:57.2439365Z git version 2.47.1 2025-05-07T19:42:57.2440333Z ##[endgroup] 2025-05-07T19:42:57.2444153Z Temporarily overriding HOME='/__w/_temp/c60ece57-b474-4353-9325-fa588fe330d4' before making global git config changes 2025-05-07T19:42:57.2444977Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:57.2445656Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:57.2478986Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:57.2492728Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:57.2507515Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:57.2510830Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:57.2531378Z HEAD 2025-05-07T19:42:57.2562174Z ##[endgroup] 2025-05-07T19:42:57.2562920Z [command]/usr/bin/git submodule status 2025-05-07T19:42:57.2935154Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:57.3007898Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (remotes/origin/FBGEMM) 2025-05-07T19:42:57.3108678Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:57.3170936Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (remotes/origin/FBGEMM) 2025-05-07T19:42:57.3387870Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (release-1.8.0-3335-gf8d7d77c) 2025-05-07T19:42:57.3466924Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (remotes/origin/mmelesse-9-g4200844) 2025-05-07T19:42:57.3514851Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (v3.11.2-84-g9cca280a) 2025-05-07T19:42:57.3533343Z ##[group]Cleaning the repository 2025-05-07T19:42:57.3533725Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:57.3584809Z Removing amdgpu-install_6.2.60204-1_all.deb 2025-05-07T19:42:57.3591583Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:42:57.4699550Z HEAD is now at a5ab0b0 Merge 3e0eb9844c62b4a9cef00aa8fd072a26f76b40ac into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:57.4702162Z ##[endgroup] 2025-05-07T19:42:57.4704429Z ##[group]Disabling automatic garbage collection 2025-05-07T19:42:57.4710090Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:42:57.4738817Z ##[endgroup] 2025-05-07T19:42:57.4739287Z ##[group]Setting up auth 2025-05-07T19:42:57.4742962Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:42:57.4769681Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:42:57.5112041Z Entering 'external/asmjit' 2025-05-07T19:42:57.5164731Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.5233013Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.5283315Z Entering 'external/cutlass' 2025-05-07T19:42:57.5354497Z Entering 'external/googletest' 2025-05-07T19:42:57.5407804Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.5466228Z Entering 'external/json' 2025-05-07T19:42:57.5535434Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:42:57.5578199Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:42:57.5859592Z Entering 'external/asmjit' 2025-05-07T19:42:57.5908035Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.5965052Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.6016583Z Entering 'external/cutlass' 2025-05-07T19:42:57.6074814Z Entering 'external/googletest' 2025-05-07T19:42:57.6135959Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.6190646Z Entering 'external/json' 2025-05-07T19:42:57.6254129Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:57.6288613Z ##[endgroup] 2025-05-07T19:42:57.6289785Z ##[group]Fetching the repository 2025-05-07T19:42:57.6295952Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:42:57.8194777Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:42:57.8195492Z + a5ab0b0...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:42:57.8217324Z ##[endgroup] 2025-05-07T19:42:57.8217773Z ##[group]Determining the checkout info 2025-05-07T19:42:57.8218333Z ##[endgroup] 2025-05-07T19:42:57.8219153Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:42:57.8767941Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:42:57.8769002Z ##[group]Checking out the ref 2025-05-07T19:42:57.8769469Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:42:57.9755581Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:42:57.9756039Z any of your branches: 2025-05-07T19:42:57.9756279Z 2025-05-07T19:42:57.9756664Z a5ab0b0 Merge 3e0eb9844c62b4a9cef00aa8fd072a26f76b40ac into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:57.9757274Z 2025-05-07T19:42:57.9757491Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:42:57.9758012Z to do so with: 2025-05-07T19:42:57.9758165Z 2025-05-07T19:42:57.9758321Z git branch a5ab0b0 2025-05-07T19:42:57.9758694Z 2025-05-07T19:42:57.9759293Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:57.9760604Z ##[endgroup] 2025-05-07T19:42:57.9761042Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:42:57.9763916Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:57.9808096Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:42:57.9829924Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:42:57.9854544Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:42:57.9876355Z ##[endgroup] 2025-05-07T19:42:57.9876782Z ##[group]Fetching submodules 2025-05-07T19:42:57.9877093Z [command]/usr/bin/git submodule sync 2025-05-07T19:42:58.0179265Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:42:58.0180716Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:42:58.0182061Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:42:58.0183218Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:42:58.0184438Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:42:58.0186117Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:42:58.0187005Z Synchronizing submodule url for 'external/json' 2025-05-07T19:42:58.0188007Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:42:58.0942853Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:42:58.3645512Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:42:58.4656639Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:42:59.1275027Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:42:59.1710264Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:42:59.1791109Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:42:59.2986367Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:42:59.2994982Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:42:59.3280233Z Entering 'external/asmjit' 2025-05-07T19:42:59.3318743Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.3352871Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.3379842Z Entering 'external/cutlass' 2025-05-07T19:42:59.3416811Z Entering 'external/googletest' 2025-05-07T19:42:59.3444725Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.3474861Z Entering 'external/json' 2025-05-07T19:42:59.3517244Z ##[endgroup] 2025-05-07T19:42:59.3517704Z ##[group]Persisting credentials for submodules 2025-05-07T19:42:59.3518786Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:42:59.3789902Z Entering 'external/asmjit' 2025-05-07T19:42:59.3839672Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.3896685Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.3950455Z Entering 'external/cutlass' 2025-05-07T19:42:59.4005064Z Entering 'external/googletest' 2025-05-07T19:42:59.4054166Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.4115647Z Entering 'external/json' 2025-05-07T19:42:59.4187234Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:42:59.4453079Z Entering 'external/asmjit' 2025-05-07T19:42:59.4494649Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:42:59.4496130Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.4542817Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:42:59.4543551Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.4587277Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:42:59.4587812Z Entering 'external/cutlass' 2025-05-07T19:42:59.4634259Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:42:59.4635715Z Entering 'external/googletest' 2025-05-07T19:42:59.4676137Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:42:59.4676698Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.4740264Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:42:59.4741095Z Entering 'external/json' 2025-05-07T19:42:59.4789049Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:42:59.4854265Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:42:59.5117963Z Entering 'external/asmjit' 2025-05-07T19:42:59.5155212Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.5187696Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.5219006Z Entering 'external/cutlass' 2025-05-07T19:42:59.5250430Z Entering 'external/googletest' 2025-05-07T19:42:59.5284785Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.5316300Z Entering 'external/json' 2025-05-07T19:42:59.5358321Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:42:59.5621141Z Entering 'external/asmjit' 2025-05-07T19:42:59.5644782Z Entering 'external/composable_kernel' 2025-05-07T19:42:59.5669822Z Entering 'external/cpuinfo' 2025-05-07T19:42:59.5705802Z Entering 'external/cutlass' 2025-05-07T19:42:59.5734344Z Entering 'external/googletest' 2025-05-07T19:42:59.5756869Z Entering 'external/hipify_torch' 2025-05-07T19:42:59.5790975Z Entering 'external/json' 2025-05-07T19:42:59.5834024Z ##[endgroup] 2025-05-07T19:42:59.5859126Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:42:59.5877220Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:42:59.6027027Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:42:59.6027439Z . $PRELUDE; print_system_info 2025-05-07T19:42:59.6028015Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:59.6028380Z env: 2025-05-07T19:42:59.6028623Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:59.6028983Z BUILD_ENV: build_binary 2025-05-07T19:42:59.6029254Z BUILD_TARGET: default 2025-05-07T19:42:59.6029528Z BUILD_VARIANT: cuda 2025-05-07T19:42:59.6029775Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:59.6030059Z ##[endgroup] 2025-05-07T19:43:00.0451355Z ################################################################################ 2025-05-07T19:43:00.0452411Z # Print System Info 2025-05-07T19:43:00.0453062Z # 2025-05-07T19:43:00.0465670Z # [2025-05-07T19:43:00.045Z] + print_system_info 2025-05-07T19:43:00.0466936Z ################################################################################ 2025-05-07T19:43:00.0467324Z 2025-05-07T19:43:00.0467601Z ################################################################################ 2025-05-07T19:43:00.0468020Z [INFO] Printing environment variables ... 2025-05-07T19:43:00.0468413Z + printenv 2025-05-07T19:43:00.0468549Z 2025-05-07T19:43:00.0473050Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:00.0473437Z BUILD_VARIANT=cuda 2025-05-07T19:43:00.0473711Z HOSTNAME=f3f10d3a0ffb 2025-05-07T19:43:00.0474209Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_e354fa53-f669-4669-81de-1c6aaf76eefb 2025-05-07T19:43:00.0474757Z GITHUB_ACTION=__run_2 2025-05-07T19:43:00.0475019Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:00.0475324Z RUNNER_NAME=i-0fc7bbcb5d3569138 2025-05-07T19:43:00.0475631Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:00.0475989Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:00.0476285Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:00.0476582Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:00.0476886Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:00.0477241Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:00.0477825Z *** 2025-05-07T19:43:00.0478095Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:00.0478438Z GITHUB_ACTIONS=true 2025-05-07T19:43:00.0478736Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:00.0479364Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:00.0479976Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:00.0480316Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:00.0480639Z RUNNER_OS=Linux 2025-05-07T19:43:00.0480899Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:00.0481197Z HOME=/github/home 2025-05-07T19:43:00.0481475Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:00.0481823Z RUNNER_ARCH=X64 2025-05-07T19:43:00.0482072Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:00.0482374Z BUILD_TARGET=default 2025-05-07T19:43:00.0482827Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_e354fa53-f669-4669-81de-1c6aaf76eefb 2025-05-07T19:43:00.0483541Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_e354fa53-f669-4669-81de-1c6aaf76eefb 2025-05-07T19:43:00.0484403Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:00.0484776Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:00.0485117Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:00.0485627Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_e354fa53-f669-4669-81de-1c6aaf76eefb 2025-05-07T19:43:00.0486404Z BUILD_ENV=build_binary 2025-05-07T19:43:00.0486676Z GITHUB_ACTOR=q10 2025-05-07T19:43:00.0486963Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:00.0487226Z KERN_NAME_LC=linux 2025-05-07T19:43:00.0487527Z BUILD_CUDA_VERSION=11.8.0 2025-05-07T19:43:00.0487871Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:00.0488286Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:00.0488631Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:00.0488945Z SHLVL=1 2025-05-07T19:43:00.0489199Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:00.0489473Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:00.0490046Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:00.0490468Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:00.0490782Z KERN_NAME=Linux 2025-05-07T19:43:00.0491030Z GITHUB_JOB=build_artifact 2025-05-07T19:43:00.0491364Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:00.0491677Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:00.0491988Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:00.0492316Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:00.0492699Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:00.0493145Z GITHUB_BASE_REF=main 2025-05-07T19:43:00.0493390Z CI=true 2025-05-07T19:43:00.0493659Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:00.0493970Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:00.0494303Z GITHUB_ACTION_REF= 2025-05-07T19:43:00.0494573Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:00.0495124Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_e354fa53-f669-4669-81de-1c6aaf76eefb 2025-05-07T19:43:00.0495642Z MACHINE_NAME=x86_64 2025-05-07T19:43:00.0495922Z _=/usr/bin/printenv 2025-05-07T19:43:00.0496078Z 2025-05-07T19:43:00.0496244Z ################################################################################ 2025-05-07T19:43:00.0496597Z [INFO] Print ldd version ... 2025-05-07T19:43:00.0496912Z + ldd --version 2025-05-07T19:43:00.0497057Z 2025-05-07T19:43:00.0497177Z ldd (GNU libc) 2.34 2025-05-07T19:43:00.0497507Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:00.0497998Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:00.0498623Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:00.0499154Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:00.0499401Z 2025-05-07T19:43:00.0499530Z ################################################################################ 2025-05-07T19:43:00.0499913Z [INFO] Print CPU info ... 2025-05-07T19:43:00.0500186Z + nproc 2025-05-07T19:43:00.0500335Z 2025-05-07T19:43:00.0503519Z 96 2025-05-07T19:43:00.0503645Z 2025-05-07T19:43:00.0503775Z + lscpu 2025-05-07T19:43:00.0503909Z 2025-05-07T19:43:00.0782227Z Architecture: x86_64 2025-05-07T19:43:00.0783439Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:00.0784664Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0786271Z Byte Order: Little Endian 2025-05-07T19:43:00.0787259Z CPU(s): 96 2025-05-07T19:43:00.0788195Z On-line CPU(s) list: 0-95 2025-05-07T19:43:00.0789089Z Vendor ID: GenuineIntel 2025-05-07T19:43:00.0789524Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0789969Z CPU family: 6 2025-05-07T19:43:00.0790280Z Model: 85 2025-05-07T19:43:00.0790623Z Thread(s) per core: 2 2025-05-07T19:43:00.0790958Z Core(s) per socket: 24 2025-05-07T19:43:00.0791301Z Socket(s): 2 2025-05-07T19:43:00.0791864Z Stepping: 7 2025-05-07T19:43:00.0792200Z BogoMIPS: 5999.98 2025-05-07T19:43:00.0794808Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0797338Z Hypervisor vendor: KVM 2025-05-07T19:43:00.0797794Z Virtualization type: full 2025-05-07T19:43:00.0798198Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:00.0798638Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:00.0799040Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:00.0799460Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:00.0799828Z NUMA node(s): 2 2025-05-07T19:43:00.0800196Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:00.0800569Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:00.0801111Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:00.0801758Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:00.0802291Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:00.0802962Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:00.0803576Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:00.0804260Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:00.0804918Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:00.0805367Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:00.0805803Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:00.0806225Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:00.0806864Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:00.0807760Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:00.0808485Z Vulnerability Srbds: Not affected 2025-05-07T19:43:00.0808892Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:00.0809210Z 2025-05-07T19:43:00.0809322Z + cat /proc/cpuinfo 2025-05-07T19:43:00.0809489Z 2025-05-07T19:43:00.0809868Z processor : 0 2025-05-07T19:43:00.0810132Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0810442Z cpu family : 6 2025-05-07T19:43:00.0810685Z model : 85 2025-05-07T19:43:00.0811027Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0811409Z stepping : 7 2025-05-07T19:43:00.0811684Z microcode : 0x5003901 2025-05-07T19:43:00.0811942Z cpu MHz : 1201.724 2025-05-07T19:43:00.0812216Z cache size : 36608 KB 2025-05-07T19:43:00.0812507Z physical id : 0 2025-05-07T19:43:00.0812776Z siblings : 48 2025-05-07T19:43:00.0813036Z core id : 0 2025-05-07T19:43:00.0813265Z cpu cores : 24 2025-05-07T19:43:00.0813522Z apicid : 0 2025-05-07T19:43:00.0813753Z initial apicid : 0 2025-05-07T19:43:00.0814018Z fpu : yes 2025-05-07T19:43:00.0814256Z fpu_exception : yes 2025-05-07T19:43:00.0814601Z cpuid level : 13 2025-05-07T19:43:00.0814840Z wp : yes 2025-05-07T19:43:00.0817275Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0820144Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0820774Z bogomips : 5999.98 2025-05-07T19:43:00.0821053Z clflush size : 64 2025-05-07T19:43:00.0821332Z cache_alignment : 64 2025-05-07T19:43:00.0821645Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0822090Z power management: 2025-05-07T19:43:00.0822241Z 2025-05-07T19:43:00.0822357Z processor : 1 2025-05-07T19:43:00.0822629Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0822909Z cpu family : 6 2025-05-07T19:43:00.0823172Z model : 85 2025-05-07T19:43:00.0823506Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0823917Z stepping : 7 2025-05-07T19:43:00.0824150Z microcode : 0x5003901 2025-05-07T19:43:00.0824436Z cpu MHz : 1198.563 2025-05-07T19:43:00.0824710Z cache size : 36608 KB 2025-05-07T19:43:00.0824967Z physical id : 0 2025-05-07T19:43:00.0825249Z siblings : 48 2025-05-07T19:43:00.0825494Z core id : 1 2025-05-07T19:43:00.0825759Z cpu cores : 24 2025-05-07T19:43:00.0825997Z apicid : 2 2025-05-07T19:43:00.0826259Z initial apicid : 2 2025-05-07T19:43:00.0826513Z fpu : yes 2025-05-07T19:43:00.0826776Z fpu_exception : yes 2025-05-07T19:43:00.0827028Z cpuid level : 13 2025-05-07T19:43:00.0827298Z wp : yes 2025-05-07T19:43:00.0829723Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0832554Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0833215Z bogomips : 5999.98 2025-05-07T19:43:00.0833487Z clflush size : 64 2025-05-07T19:43:00.0833789Z cache_alignment : 64 2025-05-07T19:43:00.0834116Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0834470Z power management: 2025-05-07T19:43:00.0834621Z 2025-05-07T19:43:00.0834746Z processor : 2 2025-05-07T19:43:00.0834983Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0835273Z cpu family : 6 2025-05-07T19:43:00.0835497Z model : 85 2025-05-07T19:43:00.0835814Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0836185Z stepping : 7 2025-05-07T19:43:00.0836445Z microcode : 0x5003901 2025-05-07T19:43:00.0836716Z cpu MHz : 1200.087 2025-05-07T19:43:00.0836954Z cache size : 36608 KB 2025-05-07T19:43:00.0837229Z physical id : 0 2025-05-07T19:43:00.0837462Z siblings : 48 2025-05-07T19:43:00.0837717Z core id : 2 2025-05-07T19:43:00.0837941Z cpu cores : 24 2025-05-07T19:43:00.0838193Z apicid : 4 2025-05-07T19:43:00.0838416Z initial apicid : 4 2025-05-07T19:43:00.0838676Z fpu : yes 2025-05-07T19:43:00.0838899Z fpu_exception : yes 2025-05-07T19:43:00.0839165Z cpuid level : 13 2025-05-07T19:43:00.0839396Z wp : yes 2025-05-07T19:43:00.0841796Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0844660Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0845409Z bogomips : 5999.98 2025-05-07T19:43:00.0845653Z clflush size : 64 2025-05-07T19:43:00.0845927Z cache_alignment : 64 2025-05-07T19:43:00.0846228Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0846605Z power management: 2025-05-07T19:43:00.0846752Z 2025-05-07T19:43:00.0846850Z processor : 3 2025-05-07T19:43:00.0847182Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0847451Z cpu family : 6 2025-05-07T19:43:00.0847705Z model : 85 2025-05-07T19:43:00.0848002Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0848405Z stepping : 7 2025-05-07T19:43:00.0848633Z microcode : 0x5003901 2025-05-07T19:43:00.0848907Z cpu MHz : 1208.659 2025-05-07T19:43:00.0849164Z cache size : 36608 KB 2025-05-07T19:43:00.0849406Z physical id : 0 2025-05-07T19:43:00.0849657Z siblings : 48 2025-05-07T19:43:00.0849873Z core id : 3 2025-05-07T19:43:00.0850117Z cpu cores : 24 2025-05-07T19:43:00.0850342Z apicid : 6 2025-05-07T19:43:00.0850585Z initial apicid : 6 2025-05-07T19:43:00.0850817Z fpu : yes 2025-05-07T19:43:00.0851059Z fpu_exception : yes 2025-05-07T19:43:00.0851293Z cpuid level : 13 2025-05-07T19:43:00.0851539Z wp : yes 2025-05-07T19:43:00.0853871Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0856552Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0857174Z bogomips : 5999.98 2025-05-07T19:43:00.0857429Z clflush size : 64 2025-05-07T19:43:00.0857661Z cache_alignment : 64 2025-05-07T19:43:00.0857980Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0858321Z power management: 2025-05-07T19:43:00.0858463Z 2025-05-07T19:43:00.0858583Z processor : 4 2025-05-07T19:43:00.0858812Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0859093Z cpu family : 6 2025-05-07T19:43:00.0859313Z model : 85 2025-05-07T19:43:00.0859637Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0860004Z stepping : 7 2025-05-07T19:43:00.0860255Z microcode : 0x5003901 2025-05-07T19:43:00.0860523Z cpu MHz : 1200.822 2025-05-07T19:43:00.0860763Z cache size : 36608 KB 2025-05-07T19:43:00.0861041Z physical id : 0 2025-05-07T19:43:00.0861271Z siblings : 48 2025-05-07T19:43:00.0861527Z core id : 4 2025-05-07T19:43:00.0861748Z cpu cores : 24 2025-05-07T19:43:00.0861989Z apicid : 8 2025-05-07T19:43:00.0862207Z initial apicid : 8 2025-05-07T19:43:00.0862465Z fpu : yes 2025-05-07T19:43:00.0862686Z fpu_exception : yes 2025-05-07T19:43:00.0862950Z cpuid level : 13 2025-05-07T19:43:00.0863175Z wp : yes 2025-05-07T19:43:00.0865511Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0868281Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0868924Z bogomips : 5999.98 2025-05-07T19:43:00.0869167Z clflush size : 64 2025-05-07T19:43:00.0869441Z cache_alignment : 64 2025-05-07T19:43:00.0869736Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0870116Z power management: 2025-05-07T19:43:00.0870266Z 2025-05-07T19:43:00.0870359Z processor : 5 2025-05-07T19:43:00.0870626Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0870889Z cpu family : 6 2025-05-07T19:43:00.0875685Z model : 85 2025-05-07T19:43:00.0876130Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0876565Z stepping : 7 2025-05-07T19:43:00.0876841Z microcode : 0x5003901 2025-05-07T19:43:00.0877102Z cpu MHz : 1198.838 2025-05-07T19:43:00.0877380Z cache size : 36608 KB 2025-05-07T19:43:00.0877637Z physical id : 0 2025-05-07T19:43:00.0877908Z siblings : 48 2025-05-07T19:43:00.0878144Z core id : 5 2025-05-07T19:43:00.0878406Z cpu cores : 24 2025-05-07T19:43:00.0878641Z apicid : 10 2025-05-07T19:43:00.0878907Z initial apicid : 10 2025-05-07T19:43:00.0879155Z fpu : yes 2025-05-07T19:43:00.0879421Z fpu_exception : yes 2025-05-07T19:43:00.0879670Z cpuid level : 13 2025-05-07T19:43:00.0879927Z wp : yes 2025-05-07T19:43:00.0882333Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0885079Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0885907Z bogomips : 5999.98 2025-05-07T19:43:00.0886180Z clflush size : 64 2025-05-07T19:43:00.0886421Z cache_alignment : 64 2025-05-07T19:43:00.0886787Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0887139Z power management: 2025-05-07T19:43:00.0887316Z 2025-05-07T19:43:00.0887413Z processor : 6 2025-05-07T19:43:00.0887652Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0887941Z cpu family : 6 2025-05-07T19:43:00.0888166Z model : 85 2025-05-07T19:43:00.0888493Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0888878Z stepping : 7 2025-05-07T19:43:00.0889138Z microcode : 0x5003901 2025-05-07T19:43:00.0889419Z cpu MHz : 1199.707 2025-05-07T19:43:00.0889664Z cache size : 36608 KB 2025-05-07T19:43:00.0889942Z physical id : 0 2025-05-07T19:43:00.0890173Z siblings : 48 2025-05-07T19:43:00.0890421Z core id : 6 2025-05-07T19:43:00.0890639Z cpu cores : 24 2025-05-07T19:43:00.0890892Z apicid : 12 2025-05-07T19:43:00.0891122Z initial apicid : 12 2025-05-07T19:43:00.0891387Z fpu : yes 2025-05-07T19:43:00.0891606Z fpu_exception : yes 2025-05-07T19:43:00.0891869Z cpuid level : 13 2025-05-07T19:43:00.0892103Z wp : yes 2025-05-07T19:43:00.0894507Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0897395Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0898161Z bogomips : 5999.98 2025-05-07T19:43:00.0898410Z clflush size : 64 2025-05-07T19:43:00.0898794Z cache_alignment : 64 2025-05-07T19:43:00.0899084Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0899438Z power management: 2025-05-07T19:43:00.0899574Z 2025-05-07T19:43:00.0899668Z processor : 7 2025-05-07T19:43:00.0899911Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0900156Z cpu family : 6 2025-05-07T19:43:00.0900397Z model : 85 2025-05-07T19:43:00.0900678Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0901126Z stepping : 7 2025-05-07T19:43:00.0901376Z microcode : 0x5003901 2025-05-07T19:43:00.0901616Z cpu MHz : 1200.151 2025-05-07T19:43:00.0901876Z cache size : 36608 KB 2025-05-07T19:43:00.0902118Z physical id : 0 2025-05-07T19:43:00.0902368Z siblings : 48 2025-05-07T19:43:00.0902580Z core id : 7 2025-05-07T19:43:00.0902827Z cpu cores : 24 2025-05-07T19:43:00.0903042Z apicid : 14 2025-05-07T19:43:00.0903283Z initial apicid : 14 2025-05-07T19:43:00.0903507Z fpu : yes 2025-05-07T19:43:00.0903738Z fpu_exception : yes 2025-05-07T19:43:00.0903963Z cpuid level : 13 2025-05-07T19:43:00.0904202Z wp : yes 2025-05-07T19:43:00.0906409Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0908953Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0909547Z bogomips : 5999.98 2025-05-07T19:43:00.0909796Z clflush size : 64 2025-05-07T19:43:00.0910025Z cache_alignment : 64 2025-05-07T19:43:00.0910332Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0910662Z power management: 2025-05-07T19:43:00.0910828Z 2025-05-07T19:43:00.0910920Z processor : 8 2025-05-07T19:43:00.0911147Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0911420Z cpu family : 6 2025-05-07T19:43:00.0911638Z model : 85 2025-05-07T19:43:00.0911945Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0912452Z stepping : 7 2025-05-07T19:43:00.0912802Z microcode : 0x5003901 2025-05-07T19:43:00.0913242Z cpu MHz : 1201.617 2025-05-07T19:43:00.0913488Z cache size : 36608 KB 2025-05-07T19:43:00.0913831Z physical id : 0 2025-05-07T19:43:00.0914058Z siblings : 48 2025-05-07T19:43:00.0914305Z core id : 8 2025-05-07T19:43:00.0914525Z cpu cores : 24 2025-05-07T19:43:00.0914775Z apicid : 16 2025-05-07T19:43:00.0915013Z initial apicid : 16 2025-05-07T19:43:00.0915275Z fpu : yes 2025-05-07T19:43:00.0915502Z fpu_exception : yes 2025-05-07T19:43:00.0915773Z cpuid level : 13 2025-05-07T19:43:00.0916002Z wp : yes 2025-05-07T19:43:00.0918399Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0921257Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0921914Z bogomips : 5999.98 2025-05-07T19:43:00.0922168Z clflush size : 64 2025-05-07T19:43:00.0922446Z cache_alignment : 64 2025-05-07T19:43:00.0922751Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0923136Z power management: 2025-05-07T19:43:00.0923284Z 2025-05-07T19:43:00.0923384Z processor : 9 2025-05-07T19:43:00.0923662Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0923941Z cpu family : 6 2025-05-07T19:43:00.0924194Z model : 85 2025-05-07T19:43:00.0924501Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0924917Z stepping : 7 2025-05-07T19:43:00.0925188Z microcode : 0x5003901 2025-05-07T19:43:00.0925495Z cpu MHz : 1200.097 2025-05-07T19:43:00.0925827Z cache size : 36608 KB 2025-05-07T19:43:00.0926093Z physical id : 0 2025-05-07T19:43:00.0926367Z siblings : 48 2025-05-07T19:43:00.0926600Z core id : 9 2025-05-07T19:43:00.0926858Z cpu cores : 24 2025-05-07T19:43:00.0927096Z apicid : 18 2025-05-07T19:43:00.0927356Z initial apicid : 18 2025-05-07T19:43:00.0927590Z fpu : yes 2025-05-07T19:43:00.0927835Z fpu_exception : yes 2025-05-07T19:43:00.0928075Z cpuid level : 13 2025-05-07T19:43:00.0928336Z wp : yes 2025-05-07T19:43:00.0930749Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0933484Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0934124Z bogomips : 5999.98 2025-05-07T19:43:00.0934387Z clflush size : 64 2025-05-07T19:43:00.0934630Z cache_alignment : 64 2025-05-07T19:43:00.0934953Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0935303Z power management: 2025-05-07T19:43:00.0935477Z 2025-05-07T19:43:00.0935575Z processor : 10 2025-05-07T19:43:00.0935822Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0936114Z cpu family : 6 2025-05-07T19:43:00.0936341Z model : 85 2025-05-07T19:43:00.0936731Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0937110Z stepping : 7 2025-05-07T19:43:00.0937365Z microcode : 0x5003901 2025-05-07T19:43:00.0937641Z cpu MHz : 2999.994 2025-05-07T19:43:00.0937886Z cache size : 36608 KB 2025-05-07T19:43:00.0938163Z physical id : 0 2025-05-07T19:43:00.0938402Z siblings : 48 2025-05-07T19:43:00.0938647Z core id : 10 2025-05-07T19:43:00.0938874Z cpu cores : 24 2025-05-07T19:43:00.0939125Z apicid : 20 2025-05-07T19:43:00.0939354Z initial apicid : 20 2025-05-07T19:43:00.0939619Z fpu : yes 2025-05-07T19:43:00.0939846Z fpu_exception : yes 2025-05-07T19:43:00.0940117Z cpuid level : 13 2025-05-07T19:43:00.0940346Z wp : yes 2025-05-07T19:43:00.0942734Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0945581Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0946237Z bogomips : 5999.98 2025-05-07T19:43:00.0946482Z clflush size : 64 2025-05-07T19:43:00.0946758Z cache_alignment : 64 2025-05-07T19:43:00.0947058Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0947447Z power management: 2025-05-07T19:43:00.0947597Z 2025-05-07T19:43:00.0947694Z processor : 11 2025-05-07T19:43:00.0947963Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0948224Z cpu family : 6 2025-05-07T19:43:00.0948478Z model : 85 2025-05-07T19:43:00.0948780Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0949182Z stepping : 7 2025-05-07T19:43:00.0949438Z microcode : 0x5003901 2025-05-07T19:43:00.0949799Z cpu MHz : 1198.607 2025-05-07T19:43:00.0950059Z cache size : 36608 KB 2025-05-07T19:43:00.0950357Z physical id : 0 2025-05-07T19:43:00.0950609Z siblings : 48 2025-05-07T19:43:00.0950997Z core id : 11 2025-05-07T19:43:00.0951248Z cpu cores : 24 2025-05-07T19:43:00.0951475Z apicid : 22 2025-05-07T19:43:00.0951728Z initial apicid : 22 2025-05-07T19:43:00.0951963Z fpu : yes 2025-05-07T19:43:00.0952212Z fpu_exception : yes 2025-05-07T19:43:00.0952454Z cpuid level : 13 2025-05-07T19:43:00.0952776Z wp : yes 2025-05-07T19:43:00.0955185Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0957932Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0958584Z bogomips : 5999.98 2025-05-07T19:43:00.0958849Z clflush size : 64 2025-05-07T19:43:00.0959094Z cache_alignment : 64 2025-05-07T19:43:00.0959430Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0959783Z power management: 2025-05-07T19:43:00.0959958Z 2025-05-07T19:43:00.0960057Z processor : 12 2025-05-07T19:43:00.0960303Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0960596Z cpu family : 6 2025-05-07T19:43:00.0960824Z model : 85 2025-05-07T19:43:00.0961143Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0961549Z stepping : 7 2025-05-07T19:43:00.0961784Z microcode : 0x5003901 2025-05-07T19:43:00.0962058Z cpu MHz : 1412.637 2025-05-07T19:43:00.0962299Z cache size : 36608 KB 2025-05-07T19:43:00.0962577Z physical id : 0 2025-05-07T19:43:00.0962812Z siblings : 48 2025-05-07T19:43:00.0963064Z core id : 12 2025-05-07T19:43:00.0963291Z cpu cores : 24 2025-05-07T19:43:00.0963545Z apicid : 24 2025-05-07T19:43:00.0963777Z initial apicid : 24 2025-05-07T19:43:00.0964048Z fpu : yes 2025-05-07T19:43:00.0964270Z fpu_exception : yes 2025-05-07T19:43:00.0964543Z cpuid level : 13 2025-05-07T19:43:00.0964786Z wp : yes 2025-05-07T19:43:00.0967633Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0970423Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0971106Z bogomips : 5999.98 2025-05-07T19:43:00.0971329Z clflush size : 64 2025-05-07T19:43:00.0971567Z cache_alignment : 64 2025-05-07T19:43:00.0971845Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0972191Z power management: 2025-05-07T19:43:00.0972327Z 2025-05-07T19:43:00.0972414Z processor : 13 2025-05-07T19:43:00.0972651Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0972893Z cpu family : 6 2025-05-07T19:43:00.0973107Z model : 85 2025-05-07T19:43:00.0973383Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0973753Z stepping : 7 2025-05-07T19:43:00.0973975Z microcode : 0x5003901 2025-05-07T19:43:00.0974213Z cpu MHz : 2999.994 2025-05-07T19:43:00.0974457Z cache size : 36608 KB 2025-05-07T19:43:00.0974681Z physical id : 0 2025-05-07T19:43:00.0974902Z siblings : 48 2025-05-07T19:43:00.0975166Z core id : 13 2025-05-07T19:43:00.0975380Z cpu cores : 24 2025-05-07T19:43:00.0975588Z apicid : 26 2025-05-07T19:43:00.0975804Z initial apicid : 26 2025-05-07T19:43:00.0976016Z fpu : yes 2025-05-07T19:43:00.0976230Z fpu_exception : yes 2025-05-07T19:43:00.0976446Z cpuid level : 13 2025-05-07T19:43:00.0976666Z wp : yes 2025-05-07T19:43:00.0979011Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0981723Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0982340Z bogomips : 5999.98 2025-05-07T19:43:00.0982575Z clflush size : 64 2025-05-07T19:43:00.0982796Z cache_alignment : 64 2025-05-07T19:43:00.0983087Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0983413Z power management: 2025-05-07T19:43:00.0983561Z 2025-05-07T19:43:00.0983647Z processor : 14 2025-05-07T19:43:00.0983863Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0984123Z cpu family : 6 2025-05-07T19:43:00.0984324Z model : 85 2025-05-07T19:43:00.0984616Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0984983Z stepping : 7 2025-05-07T19:43:00.0985194Z microcode : 0x5003901 2025-05-07T19:43:00.0985434Z cpu MHz : 2999.994 2025-05-07T19:43:00.0985652Z cache size : 36608 KB 2025-05-07T19:43:00.0986025Z physical id : 0 2025-05-07T19:43:00.0986252Z siblings : 48 2025-05-07T19:43:00.0986480Z core id : 14 2025-05-07T19:43:00.0986794Z cpu cores : 24 2025-05-07T19:43:00.0987060Z apicid : 28 2025-05-07T19:43:00.0987296Z initial apicid : 28 2025-05-07T19:43:00.0987564Z fpu : yes 2025-05-07T19:43:00.0987783Z fpu_exception : yes 2025-05-07T19:43:00.0988051Z cpuid level : 13 2025-05-07T19:43:00.0988277Z wp : yes 2025-05-07T19:43:00.0990663Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.0993523Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.0994172Z bogomips : 5999.98 2025-05-07T19:43:00.0994520Z clflush size : 64 2025-05-07T19:43:00.0994792Z cache_alignment : 64 2025-05-07T19:43:00.0995090Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.0995471Z power management: 2025-05-07T19:43:00.0995618Z 2025-05-07T19:43:00.0995720Z processor : 15 2025-05-07T19:43:00.0995992Z vendor_id : GenuineIntel 2025-05-07T19:43:00.0996255Z cpu family : 6 2025-05-07T19:43:00.0996505Z model : 85 2025-05-07T19:43:00.0996803Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.0997202Z stepping : 7 2025-05-07T19:43:00.0997459Z microcode : 0x5003901 2025-05-07T19:43:00.0997706Z cpu MHz : 2999.994 2025-05-07T19:43:00.0997972Z cache size : 36608 KB 2025-05-07T19:43:00.0998216Z physical id : 0 2025-05-07T19:43:00.0998468Z siblings : 48 2025-05-07T19:43:00.0998688Z core id : 15 2025-05-07T19:43:00.0998936Z cpu cores : 24 2025-05-07T19:43:00.0999161Z apicid : 30 2025-05-07T19:43:00.0999484Z initial apicid : 30 2025-05-07T19:43:00.0999723Z fpu : yes 2025-05-07T19:43:00.0999970Z fpu_exception : yes 2025-05-07T19:43:00.1000213Z cpuid level : 13 2025-05-07T19:43:00.1000473Z wp : yes 2025-05-07T19:43:00.1002863Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1005605Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1006262Z bogomips : 5999.98 2025-05-07T19:43:00.1006534Z clflush size : 64 2025-05-07T19:43:00.1006780Z cache_alignment : 64 2025-05-07T19:43:00.1007101Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1007451Z power management: 2025-05-07T19:43:00.1007620Z 2025-05-07T19:43:00.1007715Z processor : 16 2025-05-07T19:43:00.1007951Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1008243Z cpu family : 6 2025-05-07T19:43:00.1008514Z model : 85 2025-05-07T19:43:00.1008841Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1009257Z stepping : 7 2025-05-07T19:43:00.1009488Z microcode : 0x5003901 2025-05-07T19:43:00.1009773Z cpu MHz : 1199.749 2025-05-07T19:43:00.1010019Z cache size : 36608 KB 2025-05-07T19:43:00.1010300Z physical id : 0 2025-05-07T19:43:00.1010537Z siblings : 48 2025-05-07T19:43:00.1010791Z core id : 16 2025-05-07T19:43:00.1011015Z cpu cores : 24 2025-05-07T19:43:00.1011269Z apicid : 32 2025-05-07T19:43:00.1011499Z initial apicid : 32 2025-05-07T19:43:00.1011768Z fpu : yes 2025-05-07T19:43:00.1012000Z fpu_exception : yes 2025-05-07T19:43:00.1012288Z cpuid level : 13 2025-05-07T19:43:00.1012527Z wp : yes 2025-05-07T19:43:00.1014981Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1017918Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1018601Z bogomips : 5999.98 2025-05-07T19:43:00.1018854Z clflush size : 64 2025-05-07T19:43:00.1019127Z cache_alignment : 64 2025-05-07T19:43:00.1019505Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1019889Z power management: 2025-05-07T19:43:00.1020044Z 2025-05-07T19:43:00.1020139Z processor : 17 2025-05-07T19:43:00.1020438Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1020722Z cpu family : 6 2025-05-07T19:43:00.1020995Z model : 85 2025-05-07T19:43:00.1021325Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1021712Z stepping : 7 2025-05-07T19:43:00.1022167Z microcode : 0x5003901 2025-05-07T19:43:00.1022592Z cpu MHz : 1200.725 2025-05-07T19:43:00.1022841Z cache size : 36608 KB 2025-05-07T19:43:00.1023120Z physical id : 0 2025-05-07T19:43:00.1023350Z siblings : 48 2025-05-07T19:43:00.1023595Z core id : 17 2025-05-07T19:43:00.1023815Z cpu cores : 24 2025-05-07T19:43:00.1024063Z apicid : 34 2025-05-07T19:43:00.1024311Z initial apicid : 34 2025-05-07T19:43:00.1024543Z fpu : yes 2025-05-07T19:43:00.1024845Z fpu_exception : yes 2025-05-07T19:43:00.1025131Z cpuid level : 13 2025-05-07T19:43:00.1025403Z wp : yes 2025-05-07T19:43:00.1027809Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1030579Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1031219Z bogomips : 5999.98 2025-05-07T19:43:00.1031455Z clflush size : 64 2025-05-07T19:43:00.1031723Z cache_alignment : 64 2025-05-07T19:43:00.1032042Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1032390Z power management: 2025-05-07T19:43:00.1032607Z 2025-05-07T19:43:00.1032737Z processor : 18 2025-05-07T19:43:00.1032973Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1033263Z cpu family : 6 2025-05-07T19:43:00.1033486Z model : 85 2025-05-07T19:43:00.1033868Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1034238Z stepping : 7 2025-05-07T19:43:00.1034493Z microcode : 0x5003901 2025-05-07T19:43:00.1034739Z cpu MHz : 2999.994 2025-05-07T19:43:00.1035001Z cache size : 36608 KB 2025-05-07T19:43:00.1035249Z physical id : 0 2025-05-07T19:43:00.1035501Z siblings : 48 2025-05-07T19:43:00.1035750Z core id : 18 2025-05-07T19:43:00.1035974Z cpu cores : 24 2025-05-07T19:43:00.1036225Z apicid : 36 2025-05-07T19:43:00.1036458Z initial apicid : 36 2025-05-07T19:43:00.1036719Z fpu : yes 2025-05-07T19:43:00.1036944Z fpu_exception : yes 2025-05-07T19:43:00.1037215Z cpuid level : 13 2025-05-07T19:43:00.1037446Z wp : yes 2025-05-07T19:43:00.1039832Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1042566Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1043190Z bogomips : 5999.98 2025-05-07T19:43:00.1043452Z clflush size : 64 2025-05-07T19:43:00.1043695Z cache_alignment : 64 2025-05-07T19:43:00.1044031Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1044499Z power management: 2025-05-07T19:43:00.1044648Z 2025-05-07T19:43:00.1044747Z processor : 19 2025-05-07T19:43:00.1045130Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1045380Z cpu family : 6 2025-05-07T19:43:00.1045624Z model : 85 2025-05-07T19:43:00.1045907Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1046295Z stepping : 7 2025-05-07T19:43:00.1046514Z microcode : 0x5003901 2025-05-07T19:43:00.1046774Z cpu MHz : 1199.821 2025-05-07T19:43:00.1046999Z cache size : 36608 KB 2025-05-07T19:43:00.1047253Z physical id : 0 2025-05-07T19:43:00.1047494Z siblings : 48 2025-05-07T19:43:00.1047703Z core id : 19 2025-05-07T19:43:00.1047941Z cpu cores : 24 2025-05-07T19:43:00.1048154Z apicid : 38 2025-05-07T19:43:00.1048390Z initial apicid : 38 2025-05-07T19:43:00.1048613Z fpu : yes 2025-05-07T19:43:00.1048854Z fpu_exception : yes 2025-05-07T19:43:00.1049083Z cpuid level : 13 2025-05-07T19:43:00.1049332Z wp : yes 2025-05-07T19:43:00.1051569Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1076062Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1076855Z bogomips : 5999.98 2025-05-07T19:43:00.1077132Z clflush size : 64 2025-05-07T19:43:00.1077374Z cache_alignment : 64 2025-05-07T19:43:00.1077704Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1078071Z power management: 2025-05-07T19:43:00.1078231Z 2025-05-07T19:43:00.1078334Z processor : 20 2025-05-07T19:43:00.1078576Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1078859Z cpu family : 6 2025-05-07T19:43:00.1079079Z model : 85 2025-05-07T19:43:00.1079390Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1079794Z stepping : 7 2025-05-07T19:43:00.1080023Z microcode : 0x5003901 2025-05-07T19:43:00.1080297Z cpu MHz : 2999.994 2025-05-07T19:43:00.1080536Z cache size : 36608 KB 2025-05-07T19:43:00.1080802Z physical id : 0 2025-05-07T19:43:00.1081032Z siblings : 48 2025-05-07T19:43:00.1081269Z core id : 20 2025-05-07T19:43:00.1081482Z cpu cores : 24 2025-05-07T19:43:00.1081709Z apicid : 40 2025-05-07T19:43:00.1081928Z initial apicid : 40 2025-05-07T19:43:00.1082168Z fpu : yes 2025-05-07T19:43:00.1082381Z fpu_exception : yes 2025-05-07T19:43:00.1082628Z cpuid level : 13 2025-05-07T19:43:00.1082851Z wp : yes 2025-05-07T19:43:00.1085252Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1088174Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1088814Z bogomips : 5999.98 2025-05-07T19:43:00.1089046Z clflush size : 64 2025-05-07T19:43:00.1089355Z cache_alignment : 64 2025-05-07T19:43:00.1089641Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1089998Z power management: 2025-05-07T19:43:00.1090142Z 2025-05-07T19:43:00.1090233Z processor : 21 2025-05-07T19:43:00.1090638Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1090866Z cpu family : 6 2025-05-07T19:43:00.1091090Z model : 85 2025-05-07T19:43:00.1091375Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1091760Z stepping : 7 2025-05-07T19:43:00.1091983Z microcode : 0x5003901 2025-05-07T19:43:00.1092214Z cpu MHz : 1201.151 2025-05-07T19:43:00.1092440Z cache size : 36608 KB 2025-05-07T19:43:00.1092673Z physical id : 0 2025-05-07T19:43:00.1092907Z siblings : 48 2025-05-07T19:43:00.1093122Z core id : 21 2025-05-07T19:43:00.1093339Z cpu cores : 24 2025-05-07T19:43:00.1093554Z apicid : 42 2025-05-07T19:43:00.1093784Z initial apicid : 42 2025-05-07T19:43:00.1094013Z fpu : yes 2025-05-07T19:43:00.1094233Z fpu_exception : yes 2025-05-07T19:43:00.1094456Z cpuid level : 13 2025-05-07T19:43:00.1094676Z wp : yes 2025-05-07T19:43:00.1097110Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1099798Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1100374Z bogomips : 5999.98 2025-05-07T19:43:00.1100666Z clflush size : 64 2025-05-07T19:43:00.1100881Z cache_alignment : 64 2025-05-07T19:43:00.1101161Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1101474Z power management: 2025-05-07T19:43:00.1101615Z 2025-05-07T19:43:00.1101702Z processor : 22 2025-05-07T19:43:00.1101919Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1102173Z cpu family : 6 2025-05-07T19:43:00.1102377Z model : 85 2025-05-07T19:43:00.1102656Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1103000Z stepping : 7 2025-05-07T19:43:00.1103192Z microcode : 0x5003901 2025-05-07T19:43:00.1103424Z cpu MHz : 1204.279 2025-05-07T19:43:00.1103628Z cache size : 36608 KB 2025-05-07T19:43:00.1103849Z physical id : 0 2025-05-07T19:43:00.1104048Z siblings : 48 2025-05-07T19:43:00.1104252Z core id : 22 2025-05-07T19:43:00.1104449Z cpu cores : 24 2025-05-07T19:43:00.1104648Z apicid : 44 2025-05-07T19:43:00.1104851Z initial apicid : 44 2025-05-07T19:43:00.1105076Z fpu : yes 2025-05-07T19:43:00.1105277Z fpu_exception : yes 2025-05-07T19:43:00.1105500Z cpuid level : 13 2025-05-07T19:43:00.1105702Z wp : yes 2025-05-07T19:43:00.1107894Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1110437Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1111019Z bogomips : 5999.98 2025-05-07T19:43:00.1111225Z clflush size : 64 2025-05-07T19:43:00.1111444Z cache_alignment : 64 2025-05-07T19:43:00.1111705Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1112025Z power management: 2025-05-07T19:43:00.1112152Z 2025-05-07T19:43:00.1112230Z processor : 23 2025-05-07T19:43:00.1112461Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1112763Z cpu family : 6 2025-05-07T19:43:00.1113211Z model : 85 2025-05-07T19:43:00.1113501Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1113870Z stepping : 7 2025-05-07T19:43:00.1114110Z microcode : 0x5003901 2025-05-07T19:43:00.1114338Z cpu MHz : 1199.794 2025-05-07T19:43:00.1114574Z cache size : 36608 KB 2025-05-07T19:43:00.1114804Z physical id : 0 2025-05-07T19:43:00.1115035Z siblings : 48 2025-05-07T19:43:00.1115245Z core id : 23 2025-05-07T19:43:00.1115469Z cpu cores : 24 2025-05-07T19:43:00.1115674Z apicid : 46 2025-05-07T19:43:00.1115907Z initial apicid : 46 2025-05-07T19:43:00.1116125Z fpu : yes 2025-05-07T19:43:00.1116355Z fpu_exception : yes 2025-05-07T19:43:00.1116576Z cpuid level : 13 2025-05-07T19:43:00.1116803Z wp : yes 2025-05-07T19:43:00.1119241Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1121970Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1122591Z bogomips : 5999.98 2025-05-07T19:43:00.1122835Z clflush size : 64 2025-05-07T19:43:00.1123066Z cache_alignment : 64 2025-05-07T19:43:00.1123373Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1123712Z power management: 2025-05-07T19:43:00.1123873Z 2025-05-07T19:43:00.1123958Z processor : 24 2025-05-07T19:43:00.1124182Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1124450Z cpu family : 6 2025-05-07T19:43:00.1124647Z model : 85 2025-05-07T19:43:00.1124945Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1125419Z stepping : 7 2025-05-07T19:43:00.1125609Z microcode : 0x5003901 2025-05-07T19:43:00.1125833Z cpu MHz : 2999.994 2025-05-07T19:43:00.1126033Z cache size : 36608 KB 2025-05-07T19:43:00.1126268Z physical id : 1 2025-05-07T19:43:00.1126458Z siblings : 48 2025-05-07T19:43:00.1126676Z core id : 0 2025-05-07T19:43:00.1126863Z cpu cores : 24 2025-05-07T19:43:00.1127067Z apicid : 64 2025-05-07T19:43:00.1127264Z initial apicid : 64 2025-05-07T19:43:00.1127483Z fpu : yes 2025-05-07T19:43:00.1127665Z fpu_exception : yes 2025-05-07T19:43:00.1127884Z cpuid level : 13 2025-05-07T19:43:00.1128081Z wp : yes 2025-05-07T19:43:00.1130260Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1132780Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1133347Z bogomips : 5999.98 2025-05-07T19:43:00.1133552Z clflush size : 64 2025-05-07T19:43:00.1133770Z cache_alignment : 64 2025-05-07T19:43:00.1134026Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1134336Z power management: 2025-05-07T19:43:00.1134462Z 2025-05-07T19:43:00.1134549Z processor : 25 2025-05-07T19:43:00.1134767Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1134992Z cpu family : 6 2025-05-07T19:43:00.1135193Z model : 85 2025-05-07T19:43:00.1135452Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1135850Z stepping : 7 2025-05-07T19:43:00.1136057Z microcode : 0x5003901 2025-05-07T19:43:00.1136272Z cpu MHz : 2999.994 2025-05-07T19:43:00.1136506Z cache size : 36608 KB 2025-05-07T19:43:00.1136721Z physical id : 1 2025-05-07T19:43:00.1136947Z siblings : 48 2025-05-07T19:43:00.1137125Z core id : 1 2025-05-07T19:43:00.1137325Z cpu cores : 24 2025-05-07T19:43:00.1137514Z apicid : 66 2025-05-07T19:43:00.1137712Z initial apicid : 66 2025-05-07T19:43:00.1137905Z fpu : yes 2025-05-07T19:43:00.1138113Z fpu_exception : yes 2025-05-07T19:43:00.1138313Z cpuid level : 13 2025-05-07T19:43:00.1138520Z wp : yes 2025-05-07T19:43:00.1140746Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1143261Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1143823Z bogomips : 5999.98 2025-05-07T19:43:00.1144070Z clflush size : 64 2025-05-07T19:43:00.1144274Z cache_alignment : 64 2025-05-07T19:43:00.1144542Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1144841Z power management: 2025-05-07T19:43:00.1144984Z 2025-05-07T19:43:00.1145064Z processor : 26 2025-05-07T19:43:00.1145275Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1145516Z cpu family : 6 2025-05-07T19:43:00.1145709Z model : 85 2025-05-07T19:43:00.1145985Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1146331Z stepping : 7 2025-05-07T19:43:00.1146538Z microcode : 0x5003901 2025-05-07T19:43:00.1146764Z cpu MHz : 2999.994 2025-05-07T19:43:00.1146967Z cache size : 36608 KB 2025-05-07T19:43:00.1147197Z physical id : 1 2025-05-07T19:43:00.1147385Z siblings : 48 2025-05-07T19:43:00.1147581Z core id : 2 2025-05-07T19:43:00.1147768Z cpu cores : 24 2025-05-07T19:43:00.1147965Z apicid : 68 2025-05-07T19:43:00.1148154Z initial apicid : 68 2025-05-07T19:43:00.1148361Z fpu : yes 2025-05-07T19:43:00.1148553Z fpu_exception : yes 2025-05-07T19:43:00.1148784Z cpuid level : 13 2025-05-07T19:43:00.1149003Z wp : yes 2025-05-07T19:43:00.1151175Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1154021Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1154648Z bogomips : 5999.98 2025-05-07T19:43:00.1154883Z clflush size : 64 2025-05-07T19:43:00.1155107Z cache_alignment : 64 2025-05-07T19:43:00.1155389Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1155728Z power management: 2025-05-07T19:43:00.1155860Z 2025-05-07T19:43:00.1155952Z processor : 27 2025-05-07T19:43:00.1156185Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1156423Z cpu family : 6 2025-05-07T19:43:00.1156630Z model : 85 2025-05-07T19:43:00.1156908Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1157259Z stepping : 7 2025-05-07T19:43:00.1157485Z microcode : 0x5003901 2025-05-07T19:43:00.1157777Z cpu MHz : 2633.707 2025-05-07T19:43:00.1157995Z cache size : 36608 KB 2025-05-07T19:43:00.1158220Z physical id : 1 2025-05-07T19:43:00.1158441Z siblings : 48 2025-05-07T19:43:00.1158636Z core id : 3 2025-05-07T19:43:00.1159336Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:00.1159660Z cpu cores : 24 2025-05-07T19:43:00.1159869Z apicid : 70 2025-05-07T19:43:00.1160081Z initial apicid : 70 2025-05-07T19:43:00.1160299Z fpu : yes 2025-05-07T19:43:00.1160511Z fpu_exception : yes 2025-05-07T19:43:00.1160721Z cpuid level : 13 2025-05-07T19:43:00.1160947Z wp : yes 2025-05-07T19:43:00.1163350Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1166117Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1166687Z bogomips : 5999.98 2025-05-07T19:43:00.1166891Z clflush size : 64 2025-05-07T19:43:00.1167105Z cache_alignment : 64 2025-05-07T19:43:00.1167354Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1167677Z power management: 2025-05-07T19:43:00.1167804Z 2025-05-07T19:43:00.1167884Z processor : 28 2025-05-07T19:43:00.1168088Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1168328Z cpu family : 6 2025-05-07T19:43:00.1168512Z model : 85 2025-05-07T19:43:00.1168782Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1169111Z stepping : 7 2025-05-07T19:43:00.1169318Z microcode : 0x5003901 2025-05-07T19:43:00.1169529Z cpu MHz : 2999.994 2025-05-07T19:43:00.1169735Z cache size : 36608 KB 2025-05-07T19:43:00.1169942Z physical id : 1 2025-05-07T19:43:00.1170138Z siblings : 48 2025-05-07T19:43:00.1170322Z core id : 4 2025-05-07T19:43:00.1170512Z cpu cores : 24 2025-05-07T19:43:00.1170696Z apicid : 72 2025-05-07T19:43:00.1170893Z initial apicid : 72 2025-05-07T19:43:00.1171096Z fpu : yes 2025-05-07T19:43:00.1171277Z fpu_exception : yes 2025-05-07T19:43:00.1171496Z cpuid level : 13 2025-05-07T19:43:00.1171690Z wp : yes 2025-05-07T19:43:00.1173880Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1176413Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1176967Z bogomips : 5999.98 2025-05-07T19:43:00.1177181Z clflush size : 64 2025-05-07T19:43:00.1177381Z cache_alignment : 64 2025-05-07T19:43:00.1177640Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1177945Z power management: 2025-05-07T19:43:00.1178075Z 2025-05-07T19:43:00.1178153Z processor : 29 2025-05-07T19:43:00.1178369Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1178588Z cpu family : 6 2025-05-07T19:43:00.1178800Z model : 85 2025-05-07T19:43:00.1179054Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1179391Z stepping : 7 2025-05-07T19:43:00.1179581Z microcode : 0x5003901 2025-05-07T19:43:00.1179785Z cpu MHz : 2999.994 2025-05-07T19:43:00.1180031Z cache size : 36608 KB 2025-05-07T19:43:00.1180245Z physical id : 1 2025-05-07T19:43:00.1180433Z siblings : 48 2025-05-07T19:43:00.1180624Z core id : 5 2025-05-07T19:43:00.1180803Z cpu cores : 24 2025-05-07T19:43:00.1180998Z apicid : 74 2025-05-07T19:43:00.1181187Z initial apicid : 74 2025-05-07T19:43:00.1181386Z fpu : yes 2025-05-07T19:43:00.1181592Z fpu_exception : yes 2025-05-07T19:43:00.1181792Z cpuid level : 13 2025-05-07T19:43:00.1182006Z wp : yes 2025-05-07T19:43:00.1184244Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1187461Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1188075Z bogomips : 5999.98 2025-05-07T19:43:00.1188301Z clflush size : 64 2025-05-07T19:43:00.1188529Z cache_alignment : 64 2025-05-07T19:43:00.1188801Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1189132Z power management: 2025-05-07T19:43:00.1189269Z 2025-05-07T19:43:00.1189353Z processor : 30 2025-05-07T19:43:00.1189597Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1189872Z cpu family : 6 2025-05-07T19:43:00.1190106Z model : 85 2025-05-07T19:43:00.1190436Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1190820Z stepping : 7 2025-05-07T19:43:00.1191085Z microcode : 0x5003901 2025-05-07T19:43:00.1191345Z cpu MHz : 3718.168 2025-05-07T19:43:00.1191625Z cache size : 36608 KB 2025-05-07T19:43:00.1191878Z physical id : 1 2025-05-07T19:43:00.1192134Z siblings : 48 2025-05-07T19:43:00.1192364Z core id : 6 2025-05-07T19:43:00.1192697Z cpu cores : 24 2025-05-07T19:43:00.1192928Z apicid : 76 2025-05-07T19:43:00.1193192Z initial apicid : 76 2025-05-07T19:43:00.1193464Z fpu : yes 2025-05-07T19:43:00.1193689Z fpu_exception : yes 2025-05-07T19:43:00.1193969Z cpuid level : 13 2025-05-07T19:43:00.1194202Z wp : yes 2025-05-07T19:43:00.1196595Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1199345Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1199963Z bogomips : 5999.98 2025-05-07T19:43:00.1200221Z clflush size : 64 2025-05-07T19:43:00.1200459Z cache_alignment : 64 2025-05-07T19:43:00.1200780Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1201131Z power management: 2025-05-07T19:43:00.1201304Z 2025-05-07T19:43:00.1201398Z processor : 31 2025-05-07T19:43:00.1201664Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1201929Z cpu family : 6 2025-05-07T19:43:00.1202186Z model : 85 2025-05-07T19:43:00.1202487Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1202893Z stepping : 7 2025-05-07T19:43:00.1203128Z microcode : 0x5003901 2025-05-07T19:43:00.1203414Z cpu MHz : 1567.578 2025-05-07T19:43:00.1203664Z cache size : 36608 KB 2025-05-07T19:43:00.1203940Z physical id : 1 2025-05-07T19:43:00.1206167Z siblings : 48 2025-05-07T19:43:00.1206402Z core id : 7 2025-05-07T19:43:00.1206691Z cpu cores : 24 2025-05-07T19:43:00.1207100Z apicid : 78 2025-05-07T19:43:00.1207350Z initial apicid : 78 2025-05-07T19:43:00.1207586Z fpu : yes 2025-05-07T19:43:00.1207840Z fpu_exception : yes 2025-05-07T19:43:00.1208079Z cpuid level : 13 2025-05-07T19:43:00.1208329Z wp : yes 2025-05-07T19:43:00.1210704Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1213418Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1214057Z bogomips : 5999.98 2025-05-07T19:43:00.1214296Z clflush size : 64 2025-05-07T19:43:00.1214563Z cache_alignment : 64 2025-05-07T19:43:00.1214859Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1215234Z power management: 2025-05-07T19:43:00.1215377Z 2025-05-07T19:43:00.1215495Z processor : 32 2025-05-07T19:43:00.1215731Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1216014Z cpu family : 6 2025-05-07T19:43:00.1216235Z model : 85 2025-05-07T19:43:00.1216550Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1216915Z stepping : 7 2025-05-07T19:43:00.1217161Z microcode : 0x5003901 2025-05-07T19:43:00.1217401Z cpu MHz : 2999.994 2025-05-07T19:43:00.1217660Z cache size : 36608 KB 2025-05-07T19:43:00.1217903Z physical id : 1 2025-05-07T19:43:00.1218151Z siblings : 48 2025-05-07T19:43:00.1218375Z core id : 8 2025-05-07T19:43:00.1218724Z cpu cores : 24 2025-05-07T19:43:00.1218933Z apicid : 80 2025-05-07T19:43:00.1219166Z initial apicid : 80 2025-05-07T19:43:00.1219418Z fpu : yes 2025-05-07T19:43:00.1219622Z fpu_exception : yes 2025-05-07T19:43:00.1219868Z cpuid level : 13 2025-05-07T19:43:00.1220081Z wp : yes 2025-05-07T19:43:00.1222286Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1224847Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1225423Z bogomips : 5999.98 2025-05-07T19:43:00.1225672Z clflush size : 64 2025-05-07T19:43:00.1225898Z cache_alignment : 64 2025-05-07T19:43:00.1226197Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1226522Z power management: 2025-05-07T19:43:00.1226681Z 2025-05-07T19:43:00.1226770Z processor : 33 2025-05-07T19:43:00.1227019Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1227265Z cpu family : 6 2025-05-07T19:43:00.1227508Z model : 85 2025-05-07T19:43:00.1227797Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1228177Z stepping : 7 2025-05-07T19:43:00.1228391Z microcode : 0x5003901 2025-05-07T19:43:00.1228655Z cpu MHz : 2999.994 2025-05-07T19:43:00.1228880Z cache size : 36608 KB 2025-05-07T19:43:00.1229136Z physical id : 1 2025-05-07T19:43:00.1229448Z siblings : 48 2025-05-07T19:43:00.1229681Z core id : 9 2025-05-07T19:43:00.1229950Z cpu cores : 24 2025-05-07T19:43:00.1230195Z apicid : 82 2025-05-07T19:43:00.1230442Z initial apicid : 82 2025-05-07T19:43:00.1230652Z fpu : yes 2025-05-07T19:43:00.1230875Z fpu_exception : yes 2025-05-07T19:43:00.1231084Z cpuid level : 13 2025-05-07T19:43:00.1231308Z wp : yes 2025-05-07T19:43:00.1233757Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1236558Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1237174Z bogomips : 5999.98 2025-05-07T19:43:00.1237400Z clflush size : 64 2025-05-07T19:43:00.1237634Z cache_alignment : 64 2025-05-07T19:43:00.1237909Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1238259Z power management: 2025-05-07T19:43:00.1238391Z 2025-05-07T19:43:00.1238496Z processor : 34 2025-05-07T19:43:00.1238723Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1238984Z cpu family : 6 2025-05-07T19:43:00.1239195Z model : 85 2025-05-07T19:43:00.1239492Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1239844Z stepping : 7 2025-05-07T19:43:00.1240124Z microcode : 0x5003901 2025-05-07T19:43:00.1240360Z cpu MHz : 2270.605 2025-05-07T19:43:00.1240586Z cache size : 36608 KB 2025-05-07T19:43:00.1240812Z physical id : 1 2025-05-07T19:43:00.1241040Z siblings : 48 2025-05-07T19:43:00.1241243Z core id : 10 2025-05-07T19:43:00.1241451Z cpu cores : 24 2025-05-07T19:43:00.1241666Z apicid : 84 2025-05-07T19:43:00.1241866Z initial apicid : 84 2025-05-07T19:43:00.1242098Z fpu : yes 2025-05-07T19:43:00.1242303Z fpu_exception : yes 2025-05-07T19:43:00.1242542Z cpuid level : 13 2025-05-07T19:43:00.1242750Z wp : yes 2025-05-07T19:43:00.1245225Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1247754Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1248308Z bogomips : 5999.98 2025-05-07T19:43:00.1248527Z clflush size : 64 2025-05-07T19:43:00.1248736Z cache_alignment : 64 2025-05-07T19:43:00.1249004Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1249313Z power management: 2025-05-07T19:43:00.1249444Z 2025-05-07T19:43:00.1249524Z processor : 35 2025-05-07T19:43:00.1249751Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1249981Z cpu family : 6 2025-05-07T19:43:00.1250182Z model : 85 2025-05-07T19:43:00.1250440Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1250779Z stepping : 7 2025-05-07T19:43:00.1250979Z microcode : 0x5003901 2025-05-07T19:43:00.1251210Z cpu MHz : 2999.994 2025-05-07T19:43:00.1251421Z cache size : 36608 KB 2025-05-07T19:43:00.1251640Z physical id : 1 2025-05-07T19:43:00.1251845Z siblings : 48 2025-05-07T19:43:00.1252052Z core id : 11 2025-05-07T19:43:00.1252239Z cpu cores : 24 2025-05-07T19:43:00.1252448Z apicid : 86 2025-05-07T19:43:00.1252665Z initial apicid : 86 2025-05-07T19:43:00.1252929Z fpu : yes 2025-05-07T19:43:00.1253145Z fpu_exception : yes 2025-05-07T19:43:00.1253360Z cpuid level : 13 2025-05-07T19:43:00.1253586Z wp : yes 2025-05-07T19:43:00.1255753Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1258329Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1258894Z bogomips : 5999.98 2025-05-07T19:43:00.1259096Z clflush size : 64 2025-05-07T19:43:00.1259307Z cache_alignment : 64 2025-05-07T19:43:00.1259556Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1259871Z power management: 2025-05-07T19:43:00.1259998Z 2025-05-07T19:43:00.1260095Z processor : 36 2025-05-07T19:43:00.1260302Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1260547Z cpu family : 6 2025-05-07T19:43:00.1260744Z model : 85 2025-05-07T19:43:00.1261015Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1261340Z stepping : 7 2025-05-07T19:43:00.1261548Z microcode : 0x5003901 2025-05-07T19:43:00.1261759Z cpu MHz : 2824.480 2025-05-07T19:43:00.1261976Z cache size : 36608 KB 2025-05-07T19:43:00.1262187Z physical id : 1 2025-05-07T19:43:00.1262395Z siblings : 48 2025-05-07T19:43:00.1262591Z core id : 12 2025-05-07T19:43:00.1262790Z cpu cores : 24 2025-05-07T19:43:00.1262993Z apicid : 88 2025-05-07T19:43:00.1263184Z initial apicid : 88 2025-05-07T19:43:00.1263402Z fpu : yes 2025-05-07T19:43:00.1263588Z fpu_exception : yes 2025-05-07T19:43:00.1263799Z cpuid level : 13 2025-05-07T19:43:00.1263990Z wp : yes 2025-05-07T19:43:00.1266150Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1268672Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1269232Z bogomips : 5999.98 2025-05-07T19:43:00.1269452Z clflush size : 64 2025-05-07T19:43:00.1269657Z cache_alignment : 64 2025-05-07T19:43:00.1269924Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1270238Z power management: 2025-05-07T19:43:00.1270383Z 2025-05-07T19:43:00.1270467Z processor : 37 2025-05-07T19:43:00.1270688Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1270909Z cpu family : 6 2025-05-07T19:43:00.1271115Z model : 85 2025-05-07T19:43:00.1271372Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1271715Z stepping : 7 2025-05-07T19:43:00.1271904Z microcode : 0x5003901 2025-05-07T19:43:00.1272134Z cpu MHz : 2142.113 2025-05-07T19:43:00.1272348Z cache size : 36608 KB 2025-05-07T19:43:00.1272661Z physical id : 1 2025-05-07T19:43:00.1272872Z siblings : 48 2025-05-07T19:43:00.1273266Z core id : 13 2025-05-07T19:43:00.1273470Z cpu cores : 24 2025-05-07T19:43:00.1273700Z apicid : 90 2025-05-07T19:43:00.1273927Z initial apicid : 90 2025-05-07T19:43:00.1274149Z fpu : yes 2025-05-07T19:43:00.1274380Z fpu_exception : yes 2025-05-07T19:43:00.1274668Z cpuid level : 13 2025-05-07T19:43:00.1274902Z wp : yes 2025-05-07T19:43:00.1277268Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1279982Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1280579Z bogomips : 5999.98 2025-05-07T19:43:00.1280834Z clflush size : 64 2025-05-07T19:43:00.1281051Z cache_alignment : 64 2025-05-07T19:43:00.1281317Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1281641Z power management: 2025-05-07T19:43:00.1281770Z 2025-05-07T19:43:00.1281860Z processor : 38 2025-05-07T19:43:00.1282068Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1282310Z cpu family : 6 2025-05-07T19:43:00.1282501Z model : 85 2025-05-07T19:43:00.1282779Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1283126Z stepping : 7 2025-05-07T19:43:00.1283332Z microcode : 0x5003901 2025-05-07T19:43:00.1283557Z cpu MHz : 2999.994 2025-05-07T19:43:00.1283777Z cache size : 36608 KB 2025-05-07T19:43:00.1283998Z physical id : 1 2025-05-07T19:43:00.1284205Z siblings : 48 2025-05-07T19:43:00.1284398Z core id : 14 2025-05-07T19:43:00.1284599Z cpu cores : 24 2025-05-07T19:43:00.1284802Z apicid : 92 2025-05-07T19:43:00.1284992Z initial apicid : 92 2025-05-07T19:43:00.1285198Z fpu : yes 2025-05-07T19:43:00.1285383Z fpu_exception : yes 2025-05-07T19:43:00.1285599Z cpuid level : 13 2025-05-07T19:43:00.1285966Z wp : yes 2025-05-07T19:43:00.1288312Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1291018Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1291605Z bogomips : 5999.98 2025-05-07T19:43:00.1291823Z clflush size : 64 2025-05-07T19:43:00.1292037Z cache_alignment : 64 2025-05-07T19:43:00.1292308Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1292629Z power management: 2025-05-07T19:43:00.1292768Z 2025-05-07T19:43:00.1292847Z processor : 39 2025-05-07T19:43:00.1293064Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1293291Z cpu family : 6 2025-05-07T19:43:00.1293496Z model : 85 2025-05-07T19:43:00.1293760Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1294114Z stepping : 7 2025-05-07T19:43:00.1294313Z microcode : 0x5003901 2025-05-07T19:43:00.1294538Z cpu MHz : 2999.994 2025-05-07T19:43:00.1294744Z cache size : 36608 KB 2025-05-07T19:43:00.1294966Z physical id : 1 2025-05-07T19:43:00.1295163Z siblings : 48 2025-05-07T19:43:00.1295363Z core id : 15 2025-05-07T19:43:00.1295563Z cpu cores : 24 2025-05-07T19:43:00.1295783Z apicid : 94 2025-05-07T19:43:00.1296009Z initial apicid : 94 2025-05-07T19:43:00.1296221Z fpu : yes 2025-05-07T19:43:00.1296434Z fpu_exception : yes 2025-05-07T19:43:00.1296656Z cpuid level : 13 2025-05-07T19:43:00.1296880Z wp : yes 2025-05-07T19:43:00.1299463Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1301981Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1302544Z bogomips : 5999.98 2025-05-07T19:43:00.1302748Z clflush size : 64 2025-05-07T19:43:00.1302960Z cache_alignment : 64 2025-05-07T19:43:00.1303285Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1303609Z power management: 2025-05-07T19:43:00.1303731Z 2025-05-07T19:43:00.1303834Z processor : 40 2025-05-07T19:43:00.1304030Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1304281Z cpu family : 6 2025-05-07T19:43:00.1304469Z model : 85 2025-05-07T19:43:00.1304743Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1305071Z stepping : 7 2025-05-07T19:43:00.1305285Z microcode : 0x5003901 2025-05-07T19:43:00.1305499Z cpu MHz : 2999.994 2025-05-07T19:43:00.1305716Z cache size : 36608 KB 2025-05-07T19:43:00.1305918Z physical id : 1 2025-05-07T19:43:00.1306133Z siblings : 48 2025-05-07T19:43:00.1306325Z core id : 16 2025-05-07T19:43:00.1306529Z cpu cores : 24 2025-05-07T19:43:00.1306744Z apicid : 96 2025-05-07T19:43:00.1306925Z initial apicid : 96 2025-05-07T19:43:00.1307143Z fpu : yes 2025-05-07T19:43:00.1307318Z fpu_exception : yes 2025-05-07T19:43:00.1307532Z cpuid level : 13 2025-05-07T19:43:00.1307725Z wp : yes 2025-05-07T19:43:00.1309896Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1312424Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1313215Z bogomips : 5999.98 2025-05-07T19:43:00.1313460Z clflush size : 64 2025-05-07T19:43:00.1313682Z cache_alignment : 64 2025-05-07T19:43:00.1313972Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1314303Z power management: 2025-05-07T19:43:00.1314455Z 2025-05-07T19:43:00.1314546Z processor : 41 2025-05-07T19:43:00.1314787Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1315024Z cpu family : 6 2025-05-07T19:43:00.1315244Z model : 85 2025-05-07T19:43:00.1315520Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1315897Z stepping : 7 2025-05-07T19:43:00.1316108Z microcode : 0x5003901 2025-05-07T19:43:00.1316366Z cpu MHz : 2999.994 2025-05-07T19:43:00.1316585Z cache size : 36608 KB 2025-05-07T19:43:00.1316842Z physical id : 1 2025-05-07T19:43:00.1317059Z siblings : 48 2025-05-07T19:43:00.1317288Z core id : 17 2025-05-07T19:43:00.1317505Z cpu cores : 24 2025-05-07T19:43:00.1317712Z apicid : 98 2025-05-07T19:43:00.1317951Z initial apicid : 98 2025-05-07T19:43:00.1318177Z fpu : yes 2025-05-07T19:43:00.1318398Z fpu_exception : yes 2025-05-07T19:43:00.1318607Z cpuid level : 13 2025-05-07T19:43:00.1318837Z wp : yes 2025-05-07T19:43:00.1321182Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1323972Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1324582Z bogomips : 5999.98 2025-05-07T19:43:00.1324808Z clflush size : 64 2025-05-07T19:43:00.1325064Z cache_alignment : 64 2025-05-07T19:43:00.1325444Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1325756Z power management: 2025-05-07T19:43:00.1325922Z 2025-05-07T19:43:00.1326006Z processor : 42 2025-05-07T19:43:00.1326216Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1326475Z cpu family : 6 2025-05-07T19:43:00.1326668Z model : 85 2025-05-07T19:43:00.1326962Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1327287Z stepping : 7 2025-05-07T19:43:00.1327507Z microcode : 0x5003901 2025-05-07T19:43:00.1327722Z cpu MHz : 2999.994 2025-05-07T19:43:00.1327947Z cache size : 36608 KB 2025-05-07T19:43:00.1328166Z physical id : 1 2025-05-07T19:43:00.1328382Z siblings : 48 2025-05-07T19:43:00.1328580Z core id : 18 2025-05-07T19:43:00.1328793Z cpu cores : 24 2025-05-07T19:43:00.1329004Z apicid : 100 2025-05-07T19:43:00.1329202Z initial apicid : 100 2025-05-07T19:43:00.1329444Z fpu : yes 2025-05-07T19:43:00.1329640Z fpu_exception : yes 2025-05-07T19:43:00.1329872Z cpuid level : 13 2025-05-07T19:43:00.1330080Z wp : yes 2025-05-07T19:43:00.1332276Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1334816Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1335366Z bogomips : 5999.98 2025-05-07T19:43:00.1335599Z clflush size : 64 2025-05-07T19:43:00.1335796Z cache_alignment : 64 2025-05-07T19:43:00.1336083Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1336388Z power management: 2025-05-07T19:43:00.1336547Z 2025-05-07T19:43:00.1336633Z processor : 43 2025-05-07T19:43:00.1336856Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1337083Z cpu family : 6 2025-05-07T19:43:00.1337294Z model : 85 2025-05-07T19:43:00.1337559Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1337917Z stepping : 7 2025-05-07T19:43:00.1338112Z microcode : 0x5003901 2025-05-07T19:43:00.1338343Z cpu MHz : 2999.994 2025-05-07T19:43:00.1338551Z cache size : 36608 KB 2025-05-07T19:43:00.1338775Z physical id : 1 2025-05-07T19:43:00.1338982Z siblings : 48 2025-05-07T19:43:00.1339196Z core id : 19 2025-05-07T19:43:00.1339402Z cpu cores : 24 2025-05-07T19:43:00.1339595Z apicid : 102 2025-05-07T19:43:00.1339815Z initial apicid : 102 2025-05-07T19:43:00.1340032Z fpu : yes 2025-05-07T19:43:00.1340240Z fpu_exception : yes 2025-05-07T19:43:00.1340454Z cpuid level : 13 2025-05-07T19:43:00.1340676Z wp : yes 2025-05-07T19:43:00.1342845Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1345442Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1346034Z bogomips : 5999.98 2025-05-07T19:43:00.1346243Z clflush size : 64 2025-05-07T19:43:00.1346483Z cache_alignment : 64 2025-05-07T19:43:00.1346744Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1347080Z power management: 2025-05-07T19:43:00.1347208Z 2025-05-07T19:43:00.1347315Z processor : 44 2025-05-07T19:43:00.1347568Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1347823Z cpu family : 6 2025-05-07T19:43:00.1348020Z model : 85 2025-05-07T19:43:00.1348312Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1348644Z stepping : 7 2025-05-07T19:43:00.1348864Z microcode : 0x5003901 2025-05-07T19:43:00.1349084Z cpu MHz : 2999.994 2025-05-07T19:43:00.1349313Z cache size : 36608 KB 2025-05-07T19:43:00.1349534Z physical id : 1 2025-05-07T19:43:00.1349757Z siblings : 48 2025-05-07T19:43:00.1349952Z core id : 20 2025-05-07T19:43:00.1350151Z cpu cores : 24 2025-05-07T19:43:00.1350361Z apicid : 104 2025-05-07T19:43:00.1350559Z initial apicid : 104 2025-05-07T19:43:00.1350800Z fpu : yes 2025-05-07T19:43:00.1350994Z fpu_exception : yes 2025-05-07T19:43:00.1351219Z cpuid level : 13 2025-05-07T19:43:00.1351422Z wp : yes 2025-05-07T19:43:00.1353910Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1356665Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1357267Z bogomips : 5999.98 2025-05-07T19:43:00.1357517Z clflush size : 64 2025-05-07T19:43:00.1357738Z cache_alignment : 64 2025-05-07T19:43:00.1358051Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1358408Z power management: 2025-05-07T19:43:00.1358550Z 2025-05-07T19:43:00.1358644Z processor : 45 2025-05-07T19:43:00.1358883Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1359127Z cpu family : 6 2025-05-07T19:43:00.1359356Z model : 85 2025-05-07T19:43:00.1359639Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1360026Z stepping : 7 2025-05-07T19:43:00.1360238Z microcode : 0x5003901 2025-05-07T19:43:00.1360492Z cpu MHz : 2999.994 2025-05-07T19:43:00.1360725Z cache size : 36608 KB 2025-05-07T19:43:00.1360977Z physical id : 1 2025-05-07T19:43:00.1361196Z siblings : 48 2025-05-07T19:43:00.1361420Z core id : 21 2025-05-07T19:43:00.1361635Z cpu cores : 24 2025-05-07T19:43:00.1361833Z apicid : 106 2025-05-07T19:43:00.1362075Z initial apicid : 106 2025-05-07T19:43:00.1362295Z fpu : yes 2025-05-07T19:43:00.1362515Z fpu_exception : yes 2025-05-07T19:43:00.1362736Z cpuid level : 13 2025-05-07T19:43:00.1362976Z wp : yes 2025-05-07T19:43:00.1365427Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1368022Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1368601Z bogomips : 5999.98 2025-05-07T19:43:00.1368811Z clflush size : 64 2025-05-07T19:43:00.1369047Z cache_alignment : 64 2025-05-07T19:43:00.1369309Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1369640Z power management: 2025-05-07T19:43:00.1369770Z 2025-05-07T19:43:00.1369869Z processor : 46 2025-05-07T19:43:00.1370079Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1370326Z cpu family : 6 2025-05-07T19:43:00.1370512Z model : 85 2025-05-07T19:43:00.1370850Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1371178Z stepping : 7 2025-05-07T19:43:00.1371391Z microcode : 0x5003901 2025-05-07T19:43:00.1371597Z cpu MHz : 2999.994 2025-05-07T19:43:00.1371816Z cache size : 36608 KB 2025-05-07T19:43:00.1372031Z physical id : 1 2025-05-07T19:43:00.1372243Z siblings : 48 2025-05-07T19:43:00.1372438Z core id : 22 2025-05-07T19:43:00.1372654Z cpu cores : 24 2025-05-07T19:43:00.1372879Z apicid : 108 2025-05-07T19:43:00.1373075Z initial apicid : 108 2025-05-07T19:43:00.1373305Z fpu : yes 2025-05-07T19:43:00.1373488Z fpu_exception : yes 2025-05-07T19:43:00.1373729Z cpuid level : 13 2025-05-07T19:43:00.1373930Z wp : yes 2025-05-07T19:43:00.1376116Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1378639Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1379203Z bogomips : 5999.98 2025-05-07T19:43:00.1379437Z clflush size : 64 2025-05-07T19:43:00.1379644Z cache_alignment : 64 2025-05-07T19:43:00.1379928Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1380263Z power management: 2025-05-07T19:43:00.1380392Z 2025-05-07T19:43:00.1380480Z processor : 47 2025-05-07T19:43:00.1380701Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1380921Z cpu family : 6 2025-05-07T19:43:00.1381134Z model : 85 2025-05-07T19:43:00.1381394Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1381748Z stepping : 7 2025-05-07T19:43:00.1381944Z microcode : 0x5003901 2025-05-07T19:43:00.1382177Z cpu MHz : 2999.994 2025-05-07T19:43:00.1382385Z cache size : 36608 KB 2025-05-07T19:43:00.1382615Z physical id : 1 2025-05-07T19:43:00.1382813Z siblings : 48 2025-05-07T19:43:00.1383030Z core id : 23 2025-05-07T19:43:00.1383242Z cpu cores : 24 2025-05-07T19:43:00.1383427Z apicid : 110 2025-05-07T19:43:00.1383642Z initial apicid : 110 2025-05-07T19:43:00.1383849Z fpu : yes 2025-05-07T19:43:00.1384064Z fpu_exception : yes 2025-05-07T19:43:00.1384261Z cpuid level : 13 2025-05-07T19:43:00.1384480Z wp : yes 2025-05-07T19:43:00.1386979Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1389821Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1390440Z bogomips : 5999.98 2025-05-07T19:43:00.1390659Z clflush size : 64 2025-05-07T19:43:00.1390915Z cache_alignment : 64 2025-05-07T19:43:00.1391196Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1391554Z power management: 2025-05-07T19:43:00.1391689Z 2025-05-07T19:43:00.1391802Z processor : 48 2025-05-07T19:43:00.1392026Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1392294Z cpu family : 6 2025-05-07T19:43:00.1392558Z model : 85 2025-05-07T19:43:00.1392879Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1393307Z stepping : 7 2025-05-07T19:43:00.1393553Z microcode : 0x5003901 2025-05-07T19:43:00.1393787Z cpu MHz : 1200.723 2025-05-07T19:43:00.1394027Z cache size : 36608 KB 2025-05-07T19:43:00.1394256Z physical id : 0 2025-05-07T19:43:00.1394491Z siblings : 48 2025-05-07T19:43:00.1394718Z core id : 0 2025-05-07T19:43:00.1394923Z cpu cores : 24 2025-05-07T19:43:00.1395159Z apicid : 1 2025-05-07T19:43:00.1395350Z initial apicid : 1 2025-05-07T19:43:00.1395580Z fpu : yes 2025-05-07T19:43:00.1395773Z fpu_exception : yes 2025-05-07T19:43:00.1396004Z cpuid level : 13 2025-05-07T19:43:00.1396203Z wp : yes 2025-05-07T19:43:00.1398580Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1401327Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1401927Z bogomips : 5999.98 2025-05-07T19:43:00.1402180Z clflush size : 64 2025-05-07T19:43:00.1402420Z cache_alignment : 64 2025-05-07T19:43:00.1402747Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1403133Z power management: 2025-05-07T19:43:00.1403280Z 2025-05-07T19:43:00.1403380Z processor : 49 2025-05-07T19:43:00.1403658Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1403928Z cpu family : 6 2025-05-07T19:43:00.1404193Z model : 85 2025-05-07T19:43:00.1404603Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1404994Z stepping : 7 2025-05-07T19:43:00.1405224Z microcode : 0x5003901 2025-05-07T19:43:00.1405497Z cpu MHz : 1201.883 2025-05-07T19:43:00.1405735Z cache size : 36608 KB 2025-05-07T19:43:00.1406009Z physical id : 0 2025-05-07T19:43:00.1406233Z siblings : 48 2025-05-07T19:43:00.1406487Z core id : 1 2025-05-07T19:43:00.1406735Z cpu cores : 24 2025-05-07T19:43:00.1406950Z apicid : 3 2025-05-07T19:43:00.1407200Z initial apicid : 3 2025-05-07T19:43:00.1407436Z fpu : yes 2025-05-07T19:43:00.1407681Z fpu_exception : yes 2025-05-07T19:43:00.1407915Z cpuid level : 13 2025-05-07T19:43:00.1408166Z wp : yes 2025-05-07T19:43:00.1410359Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1412975Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1413586Z bogomips : 5999.98 2025-05-07T19:43:00.1413808Z clflush size : 64 2025-05-07T19:43:00.1414067Z cache_alignment : 64 2025-05-07T19:43:00.1414342Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1414695Z power management: 2025-05-07T19:43:00.1414828Z 2025-05-07T19:43:00.1414946Z processor : 50 2025-05-07T19:43:00.1415172Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1415440Z cpu family : 6 2025-05-07T19:43:00.1415652Z model : 85 2025-05-07T19:43:00.1415952Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1416299Z stepping : 7 2025-05-07T19:43:00.1416545Z microcode : 0x5003901 2025-05-07T19:43:00.1416824Z cpu MHz : 2999.994 2025-05-07T19:43:00.1417087Z cache size : 36608 KB 2025-05-07T19:43:00.1417330Z physical id : 0 2025-05-07T19:43:00.1417585Z siblings : 48 2025-05-07T19:43:00.1417829Z core id : 2 2025-05-07T19:43:00.1418044Z cpu cores : 24 2025-05-07T19:43:00.1418281Z apicid : 5 2025-05-07T19:43:00.1418494Z initial apicid : 5 2025-05-07T19:43:00.1418746Z fpu : yes 2025-05-07T19:43:00.1418955Z fpu_exception : yes 2025-05-07T19:43:00.1419210Z cpuid level : 13 2025-05-07T19:43:00.1419428Z wp : yes 2025-05-07T19:43:00.1421637Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1424197Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1424777Z bogomips : 5999.98 2025-05-07T19:43:00.1425033Z clflush size : 64 2025-05-07T19:43:00.1425261Z cache_alignment : 64 2025-05-07T19:43:00.1425567Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1425917Z power management: 2025-05-07T19:43:00.1426052Z 2025-05-07T19:43:00.1426144Z processor : 51 2025-05-07T19:43:00.1426391Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1426635Z cpu family : 6 2025-05-07T19:43:00.1426868Z model : 85 2025-05-07T19:43:00.1427141Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1427515Z stepping : 7 2025-05-07T19:43:00.1427732Z microcode : 0x5003901 2025-05-07T19:43:00.1427988Z cpu MHz : 1204.950 2025-05-07T19:43:00.1428212Z cache size : 36608 KB 2025-05-07T19:43:00.1428470Z physical id : 0 2025-05-07T19:43:00.1428690Z siblings : 48 2025-05-07T19:43:00.1428924Z core id : 3 2025-05-07T19:43:00.1429159Z cpu cores : 24 2025-05-07T19:43:00.1429371Z apicid : 7 2025-05-07T19:43:00.1429607Z initial apicid : 7 2025-05-07T19:43:00.1429830Z fpu : yes 2025-05-07T19:43:00.1430061Z fpu_exception : yes 2025-05-07T19:43:00.1430286Z cpuid level : 13 2025-05-07T19:43:00.1430525Z wp : yes 2025-05-07T19:43:00.1432781Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1435767Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1436412Z bogomips : 5999.98 2025-05-07T19:43:00.1436652Z clflush size : 64 2025-05-07T19:43:00.1436923Z cache_alignment : 64 2025-05-07T19:43:00.1437216Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1437590Z power management: 2025-05-07T19:43:00.1437734Z 2025-05-07T19:43:00.1437857Z processor : 52 2025-05-07T19:43:00.1438103Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1438393Z cpu family : 6 2025-05-07T19:43:00.1438619Z model : 85 2025-05-07T19:43:00.1438948Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1439324Z stepping : 7 2025-05-07T19:43:00.1439579Z microcode : 0x5003901 2025-05-07T19:43:00.1439833Z cpu MHz : 2999.994 2025-05-07T19:43:00.1440108Z cache size : 36608 KB 2025-05-07T19:43:00.1440418Z physical id : 0 2025-05-07T19:43:00.1440682Z siblings : 48 2025-05-07T19:43:00.1440935Z core id : 4 2025-05-07T19:43:00.1441157Z cpu cores : 24 2025-05-07T19:43:00.1441416Z apicid : 9 2025-05-07T19:43:00.1441637Z initial apicid : 9 2025-05-07T19:43:00.1441901Z fpu : yes 2025-05-07T19:43:00.1442120Z fpu_exception : yes 2025-05-07T19:43:00.1442382Z cpuid level : 13 2025-05-07T19:43:00.1442615Z wp : yes 2025-05-07T19:43:00.1445017Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1447696Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1448279Z bogomips : 5999.98 2025-05-07T19:43:00.1448543Z clflush size : 64 2025-05-07T19:43:00.1448772Z cache_alignment : 64 2025-05-07T19:43:00.1449087Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1449456Z power management: 2025-05-07T19:43:00.1449598Z 2025-05-07T19:43:00.1449689Z processor : 53 2025-05-07T19:43:00.1449958Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1450214Z cpu family : 6 2025-05-07T19:43:00.1450468Z model : 85 2025-05-07T19:43:00.1450757Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1451146Z stepping : 7 2025-05-07T19:43:00.1451381Z microcode : 0x5003901 2025-05-07T19:43:00.1451655Z cpu MHz : 1202.559 2025-05-07T19:43:00.1451974Z cache size : 36608 KB 2025-05-07T19:43:00.1452242Z physical id : 0 2025-05-07T19:43:00.1452465Z siblings : 48 2025-05-07T19:43:00.1452713Z core id : 5 2025-05-07T19:43:00.1452964Z cpu cores : 24 2025-05-07T19:43:00.1453190Z apicid : 11 2025-05-07T19:43:00.1453443Z initial apicid : 11 2025-05-07T19:43:00.1453660Z fpu : yes 2025-05-07T19:43:00.1453914Z fpu_exception : yes 2025-05-07T19:43:00.1454147Z cpuid level : 13 2025-05-07T19:43:00.1454390Z wp : yes 2025-05-07T19:43:00.1456580Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1459146Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1459797Z bogomips : 5999.98 2025-05-07T19:43:00.1460017Z clflush size : 64 2025-05-07T19:43:00.1460273Z cache_alignment : 64 2025-05-07T19:43:00.1475162Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1475306Z power management: 2025-05-07T19:43:00.1475313Z 2025-05-07T19:43:00.1475418Z processor : 54 2025-05-07T19:43:00.1475519Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1475602Z cpu family : 6 2025-05-07T19:43:00.1475696Z model : 85 2025-05-07T19:43:00.1475871Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1475956Z stepping : 7 2025-05-07T19:43:00.1476042Z microcode : 0x5003901 2025-05-07T19:43:00.1476150Z cpu MHz : 2999.994 2025-05-07T19:43:00.1476240Z cache size : 36608 KB 2025-05-07T19:43:00.1476327Z physical id : 0 2025-05-07T19:43:00.1476424Z siblings : 48 2025-05-07T19:43:00.1476509Z core id : 6 2025-05-07T19:43:00.1476737Z cpu cores : 24 2025-05-07T19:43:00.1476823Z apicid : 13 2025-05-07T19:43:00.1476944Z initial apicid : 13 2025-05-07T19:43:00.1477027Z fpu : yes 2025-05-07T19:43:00.1477115Z fpu_exception : yes 2025-05-07T19:43:00.1477204Z cpuid level : 13 2025-05-07T19:43:00.1477308Z wp : yes 2025-05-07T19:43:00.1479559Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1479981Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1480071Z bogomips : 5999.98 2025-05-07T19:43:00.1480168Z clflush size : 64 2025-05-07T19:43:00.1480265Z cache_alignment : 64 2025-05-07T19:43:00.1480422Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1480512Z power management: 2025-05-07T19:43:00.1480517Z 2025-05-07T19:43:00.1480603Z processor : 55 2025-05-07T19:43:00.1480712Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1480790Z cpu family : 6 2025-05-07T19:43:00.1480870Z model : 85 2025-05-07T19:43:00.1481049Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1481127Z stepping : 7 2025-05-07T19:43:00.1481214Z microcode : 0x5003901 2025-05-07T19:43:00.1481298Z cpu MHz : 2999.994 2025-05-07T19:43:00.1481398Z cache size : 36608 KB 2025-05-07T19:43:00.1481483Z physical id : 0 2025-05-07T19:43:00.1481560Z siblings : 48 2025-05-07T19:43:00.1481642Z core id : 7 2025-05-07T19:43:00.1481739Z cpu cores : 24 2025-05-07T19:43:00.1481821Z apicid : 15 2025-05-07T19:43:00.1481912Z initial apicid : 15 2025-05-07T19:43:00.1482006Z fpu : yes 2025-05-07T19:43:00.1482088Z fpu_exception : yes 2025-05-07T19:43:00.1482179Z cpuid level : 13 2025-05-07T19:43:00.1482252Z wp : yes 2025-05-07T19:43:00.1484499Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1484899Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1484995Z bogomips : 5999.98 2025-05-07T19:43:00.1485132Z clflush size : 64 2025-05-07T19:43:00.1485228Z cache_alignment : 64 2025-05-07T19:43:00.1485360Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1485458Z power management: 2025-05-07T19:43:00.1485462Z 2025-05-07T19:43:00.1485544Z processor : 56 2025-05-07T19:43:00.1485632Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1485888Z cpu family : 6 2025-05-07T19:43:00.1485968Z model : 85 2025-05-07T19:43:00.1486138Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1486226Z stepping : 7 2025-05-07T19:43:00.1486323Z microcode : 0x5003901 2025-05-07T19:43:00.1486408Z cpu MHz : 1196.083 2025-05-07T19:43:00.1486485Z cache size : 36608 KB 2025-05-07T19:43:00.1486581Z physical id : 0 2025-05-07T19:43:00.1486658Z siblings : 48 2025-05-07T19:43:00.1486740Z core id : 8 2025-05-07T19:43:00.1486819Z cpu cores : 24 2025-05-07T19:43:00.1486915Z apicid : 17 2025-05-07T19:43:00.1487559Z initial apicid : 17 2025-05-07T19:43:00.1487644Z fpu : yes 2025-05-07T19:43:00.1487748Z fpu_exception : yes 2025-05-07T19:43:00.1487828Z cpuid level : 13 2025-05-07T19:43:00.1487901Z wp : yes 2025-05-07T19:43:00.1490149Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1490547Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1490627Z bogomips : 5999.98 2025-05-07T19:43:00.1490726Z clflush size : 64 2025-05-07T19:43:00.1490821Z cache_alignment : 64 2025-05-07T19:43:00.1490956Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1491038Z power management: 2025-05-07T19:43:00.1491043Z 2025-05-07T19:43:00.1491139Z processor : 57 2025-05-07T19:43:00.1491235Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1491312Z cpu family : 6 2025-05-07T19:43:00.1491401Z model : 85 2025-05-07T19:43:00.1491565Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1491652Z stepping : 7 2025-05-07T19:43:00.1491735Z microcode : 0x5003901 2025-05-07T19:43:00.1491838Z cpu MHz : 2999.994 2025-05-07T19:43:00.1491922Z cache size : 36608 KB 2025-05-07T19:43:00.1492007Z physical id : 0 2025-05-07T19:43:00.1492098Z siblings : 48 2025-05-07T19:43:00.1492176Z core id : 9 2025-05-07T19:43:00.1492253Z cpu cores : 24 2025-05-07T19:43:00.1492328Z apicid : 19 2025-05-07T19:43:00.1492419Z initial apicid : 19 2025-05-07T19:43:00.1492494Z fpu : yes 2025-05-07T19:43:00.1492584Z fpu_exception : yes 2025-05-07T19:43:00.1492675Z cpuid level : 13 2025-05-07T19:43:00.1492749Z wp : yes 2025-05-07T19:43:00.1494979Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1495387Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1495469Z bogomips : 5999.98 2025-05-07T19:43:00.1495555Z clflush size : 64 2025-05-07T19:43:00.1495657Z cache_alignment : 64 2025-05-07T19:43:00.1495858Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1495943Z power management: 2025-05-07T19:43:00.1495948Z 2025-05-07T19:43:00.1496028Z processor : 58 2025-05-07T19:43:00.1496123Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1496208Z cpu family : 6 2025-05-07T19:43:00.1496287Z model : 85 2025-05-07T19:43:00.1496466Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1496556Z stepping : 7 2025-05-07T19:43:00.1496646Z microcode : 0x5003901 2025-05-07T19:43:00.1496736Z cpu MHz : 1205.845 2025-05-07T19:43:00.1496835Z cache size : 36608 KB 2025-05-07T19:43:00.1496926Z physical id : 0 2025-05-07T19:43:00.1497010Z siblings : 48 2025-05-07T19:43:00.1497104Z core id : 10 2025-05-07T19:43:00.1497186Z cpu cores : 24 2025-05-07T19:43:00.1497268Z apicid : 21 2025-05-07T19:43:00.1497356Z initial apicid : 21 2025-05-07T19:43:00.1497443Z fpu : yes 2025-05-07T19:43:00.1497531Z fpu_exception : yes 2025-05-07T19:43:00.1497660Z cpuid level : 13 2025-05-07T19:43:00.1497858Z wp : yes 2025-05-07T19:43:00.1499928Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1500288Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1500382Z bogomips : 5999.98 2025-05-07T19:43:00.1500460Z clflush size : 64 2025-05-07T19:43:00.1500536Z cache_alignment : 64 2025-05-07T19:43:00.1500682Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1500764Z power management: 2025-05-07T19:43:00.1500769Z 2025-05-07T19:43:00.1500842Z processor : 59 2025-05-07T19:43:00.1500930Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1501023Z cpu family : 6 2025-05-07T19:43:00.1501094Z model : 85 2025-05-07T19:43:00.1501248Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1501341Z stepping : 7 2025-05-07T19:43:00.1501420Z microcode : 0x5003901 2025-05-07T19:43:00.1501500Z cpu MHz : 2999.994 2025-05-07T19:43:00.1501580Z cache size : 36608 KB 2025-05-07T19:43:00.1501680Z physical id : 0 2025-05-07T19:43:00.1501752Z siblings : 48 2025-05-07T19:43:00.1501824Z core id : 11 2025-05-07T19:43:00.1501916Z cpu cores : 24 2025-05-07T19:43:00.1501993Z apicid : 23 2025-05-07T19:43:00.1502069Z initial apicid : 23 2025-05-07T19:43:00.1502142Z fpu : yes 2025-05-07T19:43:00.1502236Z fpu_exception : yes 2025-05-07T19:43:00.1502304Z cpuid level : 13 2025-05-07T19:43:00.1502376Z wp : yes 2025-05-07T19:43:00.1504450Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1504812Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1504890Z bogomips : 5999.98 2025-05-07T19:43:00.1504982Z clflush size : 64 2025-05-07T19:43:00.1505067Z cache_alignment : 64 2025-05-07T19:43:00.1505187Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1505279Z power management: 2025-05-07T19:43:00.1505336Z 2025-05-07T19:43:00.1505414Z processor : 60 2025-05-07T19:43:00.1505496Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1505569Z cpu family : 6 2025-05-07T19:43:00.1505659Z model : 85 2025-05-07T19:43:00.1505813Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1505886Z stepping : 7 2025-05-07T19:43:00.1505981Z microcode : 0x5003901 2025-05-07T19:43:00.1506054Z cpu MHz : 2999.994 2025-05-07T19:43:00.1506131Z cache size : 36608 KB 2025-05-07T19:43:00.1506206Z physical id : 0 2025-05-07T19:43:00.1506288Z siblings : 48 2025-05-07T19:43:00.1506358Z core id : 12 2025-05-07T19:43:00.1506435Z cpu cores : 24 2025-05-07T19:43:00.1506506Z apicid : 25 2025-05-07T19:43:00.1506590Z initial apicid : 25 2025-05-07T19:43:00.1506665Z fpu : yes 2025-05-07T19:43:00.1506741Z fpu_exception : yes 2025-05-07T19:43:00.1506828Z cpuid level : 13 2025-05-07T19:43:00.1506900Z wp : yes 2025-05-07T19:43:00.1509011Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1509389Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1509468Z bogomips : 5999.98 2025-05-07T19:43:00.1509549Z clflush size : 64 2025-05-07T19:43:00.1509638Z cache_alignment : 64 2025-05-07T19:43:00.1509764Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1509846Z power management: 2025-05-07T19:43:00.1509850Z 2025-05-07T19:43:00.1509943Z processor : 61 2025-05-07T19:43:00.1510021Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1510100Z cpu family : 6 2025-05-07T19:43:00.1510171Z model : 85 2025-05-07T19:43:00.1510329Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1510406Z stepping : 7 2025-05-07T19:43:00.1510484Z microcode : 0x5003901 2025-05-07T19:43:00.1510578Z cpu MHz : 1204.462 2025-05-07T19:43:00.1510654Z cache size : 36608 KB 2025-05-07T19:43:00.1510731Z physical id : 0 2025-05-07T19:43:00.1510799Z siblings : 48 2025-05-07T19:43:00.1510890Z core id : 13 2025-05-07T19:43:00.1510963Z cpu cores : 24 2025-05-07T19:43:00.1511031Z apicid : 27 2025-05-07T19:43:00.1511108Z initial apicid : 27 2025-05-07T19:43:00.1511188Z fpu : yes 2025-05-07T19:43:00.1511271Z fpu_exception : yes 2025-05-07T19:43:00.1511348Z cpuid level : 13 2025-05-07T19:43:00.1511424Z wp : yes 2025-05-07T19:43:00.1513800Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1514201Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1514305Z bogomips : 5999.98 2025-05-07T19:43:00.1514390Z clflush size : 64 2025-05-07T19:43:00.1514478Z cache_alignment : 64 2025-05-07T19:43:00.1514630Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1514718Z power management: 2025-05-07T19:43:00.1514723Z 2025-05-07T19:43:00.1514809Z processor : 62 2025-05-07T19:43:00.1514904Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1515060Z cpu family : 6 2025-05-07T19:43:00.1515137Z model : 85 2025-05-07T19:43:00.1515293Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1515393Z stepping : 7 2025-05-07T19:43:00.1515480Z microcode : 0x5003901 2025-05-07T19:43:00.1515561Z cpu MHz : 1199.356 2025-05-07T19:43:00.1515645Z cache size : 36608 KB 2025-05-07T19:43:00.1515748Z physical id : 0 2025-05-07T19:43:00.1515822Z siblings : 48 2025-05-07T19:43:00.1515901Z core id : 14 2025-05-07T19:43:00.1515996Z cpu cores : 24 2025-05-07T19:43:00.1516075Z apicid : 29 2025-05-07T19:43:00.1516160Z initial apicid : 29 2025-05-07T19:43:00.1516238Z fpu : yes 2025-05-07T19:43:00.1516337Z fpu_exception : yes 2025-05-07T19:43:00.1516415Z cpuid level : 13 2025-05-07T19:43:00.1516487Z wp : yes 2025-05-07T19:43:00.1518784Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1519185Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1519266Z bogomips : 5999.98 2025-05-07T19:43:00.1519357Z clflush size : 64 2025-05-07T19:43:00.1519446Z cache_alignment : 64 2025-05-07T19:43:00.1519577Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1519674Z power management: 2025-05-07T19:43:00.1519678Z 2025-05-07T19:43:00.1519761Z processor : 63 2025-05-07T19:43:00.1519854Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1519928Z cpu family : 6 2025-05-07T19:43:00.1520022Z model : 85 2025-05-07T19:43:00.1520182Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1520262Z stepping : 7 2025-05-07T19:43:00.1520364Z microcode : 0x5003901 2025-05-07T19:43:00.1520444Z cpu MHz : 1199.184 2025-05-07T19:43:00.1520533Z cache size : 36608 KB 2025-05-07T19:43:00.1520616Z physical id : 0 2025-05-07T19:43:00.1520707Z siblings : 48 2025-05-07T19:43:00.1520781Z core id : 15 2025-05-07T19:43:00.1520864Z cpu cores : 24 2025-05-07T19:43:00.1520952Z apicid : 31 2025-05-07T19:43:00.1521035Z initial apicid : 31 2025-05-07T19:43:00.1521113Z fpu : yes 2025-05-07T19:43:00.1521204Z fpu_exception : yes 2025-05-07T19:43:00.1521303Z cpuid level : 13 2025-05-07T19:43:00.1521381Z wp : yes 2025-05-07T19:43:00.1523609Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1524019Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1524105Z bogomips : 5999.98 2025-05-07T19:43:00.1524183Z clflush size : 64 2025-05-07T19:43:00.1524274Z cache_alignment : 64 2025-05-07T19:43:00.1524406Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1524490Z power management: 2025-05-07T19:43:00.1524494Z 2025-05-07T19:43:00.1524578Z processor : 64 2025-05-07T19:43:00.1524670Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1524753Z cpu family : 6 2025-05-07T19:43:00.1524831Z model : 85 2025-05-07T19:43:00.1525048Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1525242Z stepping : 7 2025-05-07T19:43:00.1525324Z microcode : 0x5003901 2025-05-07T19:43:00.1525411Z cpu MHz : 2999.994 2025-05-07T19:43:00.1525487Z cache size : 36608 KB 2025-05-07T19:43:00.1525564Z physical id : 0 2025-05-07T19:43:00.1525635Z siblings : 48 2025-05-07T19:43:00.1525713Z core id : 16 2025-05-07T19:43:00.1525780Z cpu cores : 24 2025-05-07T19:43:00.1525853Z apicid : 33 2025-05-07T19:43:00.1525939Z initial apicid : 33 2025-05-07T19:43:00.1526011Z fpu : yes 2025-05-07T19:43:00.1526094Z fpu_exception : yes 2025-05-07T19:43:00.1526172Z cpuid level : 13 2025-05-07T19:43:00.1526256Z wp : yes 2025-05-07T19:43:00.1528359Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1528743Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1528821Z bogomips : 5999.98 2025-05-07T19:43:00.1528894Z clflush size : 64 2025-05-07T19:43:00.1528972Z cache_alignment : 64 2025-05-07T19:43:00.1529098Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1529172Z power management: 2025-05-07T19:43:00.1529176Z 2025-05-07T19:43:00.1529249Z processor : 65 2025-05-07T19:43:00.1529340Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1529416Z cpu family : 6 2025-05-07T19:43:00.1529487Z model : 85 2025-05-07T19:43:00.1529632Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1529719Z stepping : 7 2025-05-07T19:43:00.1529804Z microcode : 0x5003901 2025-05-07T19:43:00.1529876Z cpu MHz : 1201.161 2025-05-07T19:43:00.1529956Z cache size : 36608 KB 2025-05-07T19:43:00.1530031Z physical id : 0 2025-05-07T19:43:00.1530102Z siblings : 48 2025-05-07T19:43:00.1530170Z core id : 17 2025-05-07T19:43:00.1530253Z cpu cores : 24 2025-05-07T19:43:00.1530325Z apicid : 35 2025-05-07T19:43:00.1530398Z initial apicid : 35 2025-05-07T19:43:00.1530492Z fpu : yes 2025-05-07T19:43:00.1530569Z fpu_exception : yes 2025-05-07T19:43:00.1530643Z cpuid level : 13 2025-05-07T19:43:00.1530717Z wp : yes 2025-05-07T19:43:00.1532785Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1533151Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1533248Z bogomips : 5999.98 2025-05-07T19:43:00.1533325Z clflush size : 64 2025-05-07T19:43:00.1533401Z cache_alignment : 64 2025-05-07T19:43:00.1533518Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1533603Z power management: 2025-05-07T19:43:00.1533608Z 2025-05-07T19:43:00.1533687Z processor : 66 2025-05-07T19:43:00.1533777Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1533875Z cpu family : 6 2025-05-07T19:43:00.1533948Z model : 85 2025-05-07T19:43:00.1534102Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1534230Z stepping : 7 2025-05-07T19:43:00.1534331Z microcode : 0x5003901 2025-05-07T19:43:00.1534411Z cpu MHz : 1200.133 2025-05-07T19:43:00.1534493Z cache size : 36608 KB 2025-05-07T19:43:00.1534600Z physical id : 0 2025-05-07T19:43:00.1534675Z siblings : 48 2025-05-07T19:43:00.1534750Z core id : 18 2025-05-07T19:43:00.1534833Z cpu cores : 24 2025-05-07T19:43:00.1534930Z apicid : 37 2025-05-07T19:43:00.1535010Z initial apicid : 37 2025-05-07T19:43:00.1535087Z fpu : yes 2025-05-07T19:43:00.1535184Z fpu_exception : yes 2025-05-07T19:43:00.1535262Z cpuid level : 13 2025-05-07T19:43:00.1535336Z wp : yes 2025-05-07T19:43:00.1537467Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1537842Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1537922Z bogomips : 5999.98 2025-05-07T19:43:00.1538018Z clflush size : 64 2025-05-07T19:43:00.1538105Z cache_alignment : 64 2025-05-07T19:43:00.1538229Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1538311Z power management: 2025-05-07T19:43:00.1538315Z 2025-05-07T19:43:00.1538411Z processor : 67 2025-05-07T19:43:00.1538499Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1538576Z cpu family : 6 2025-05-07T19:43:00.1538663Z model : 85 2025-05-07T19:43:00.1538821Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1538901Z stepping : 7 2025-05-07T19:43:00.1538985Z microcode : 0x5003901 2025-05-07T19:43:00.1539085Z cpu MHz : 2999.994 2025-05-07T19:43:00.1539174Z cache size : 36608 KB 2025-05-07T19:43:00.1539256Z physical id : 0 2025-05-07T19:43:00.1539345Z siblings : 48 2025-05-07T19:43:00.1539421Z core id : 19 2025-05-07T19:43:00.1539502Z cpu cores : 24 2025-05-07T19:43:00.1539582Z apicid : 39 2025-05-07T19:43:00.1539672Z initial apicid : 39 2025-05-07T19:43:00.1539743Z fpu : yes 2025-05-07T19:43:00.1539826Z fpu_exception : yes 2025-05-07T19:43:00.1539917Z cpuid level : 13 2025-05-07T19:43:00.1539990Z wp : yes 2025-05-07T19:43:00.1542057Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1542440Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1542518Z bogomips : 5999.98 2025-05-07T19:43:00.1542595Z clflush size : 64 2025-05-07T19:43:00.1542687Z cache_alignment : 64 2025-05-07T19:43:00.1542802Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1542877Z power management: 2025-05-07T19:43:00.1542882Z 2025-05-07T19:43:00.1542954Z processor : 68 2025-05-07T19:43:00.1543051Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1543123Z cpu family : 6 2025-05-07T19:43:00.1543191Z model : 85 2025-05-07T19:43:00.1543352Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1543423Z stepping : 7 2025-05-07T19:43:00.1543500Z microcode : 0x5003901 2025-05-07T19:43:00.1543622Z cpu MHz : 1200.066 2025-05-07T19:43:00.1543711Z cache size : 36608 KB 2025-05-07T19:43:00.1543794Z physical id : 0 2025-05-07T19:43:00.1543869Z siblings : 48 2025-05-07T19:43:00.1543962Z core id : 20 2025-05-07T19:43:00.1544037Z cpu cores : 24 2025-05-07T19:43:00.1544115Z apicid : 41 2025-05-07T19:43:00.1544195Z initial apicid : 41 2025-05-07T19:43:00.1544285Z fpu : yes 2025-05-07T19:43:00.1544366Z fpu_exception : yes 2025-05-07T19:43:00.1544440Z cpuid level : 13 2025-05-07T19:43:00.1544512Z wp : yes 2025-05-07T19:43:00.1546632Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1547001Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1547100Z bogomips : 5999.98 2025-05-07T19:43:00.1547183Z clflush size : 64 2025-05-07T19:43:00.1547265Z cache_alignment : 64 2025-05-07T19:43:00.1547404Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1547479Z power management: 2025-05-07T19:43:00.1547484Z 2025-05-07T19:43:00.1547562Z processor : 69 2025-05-07T19:43:00.1547641Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1547731Z cpu family : 6 2025-05-07T19:43:00.1547802Z model : 85 2025-05-07T19:43:00.1547951Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1548053Z stepping : 7 2025-05-07T19:43:00.1548128Z microcode : 0x5003901 2025-05-07T19:43:00.1548206Z cpu MHz : 2999.994 2025-05-07T19:43:00.1548281Z cache size : 36608 KB 2025-05-07T19:43:00.1548362Z physical id : 0 2025-05-07T19:43:00.1548437Z siblings : 48 2025-05-07T19:43:00.1548510Z core id : 21 2025-05-07T19:43:00.1548598Z cpu cores : 24 2025-05-07T19:43:00.1548675Z apicid : 43 2025-05-07T19:43:00.1548760Z initial apicid : 43 2025-05-07T19:43:00.1548836Z fpu : yes 2025-05-07T19:43:00.1548933Z fpu_exception : yes 2025-05-07T19:43:00.1549012Z cpuid level : 13 2025-05-07T19:43:00.1549084Z wp : yes 2025-05-07T19:43:00.1551161Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1551532Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1551612Z bogomips : 5999.98 2025-05-07T19:43:00.1551722Z clflush size : 64 2025-05-07T19:43:00.1551810Z cache_alignment : 64 2025-05-07T19:43:00.1551933Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1552012Z power management: 2025-05-07T19:43:00.1552032Z 2025-05-07T19:43:00.1552114Z processor : 70 2025-05-07T19:43:00.1552195Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1552272Z cpu family : 6 2025-05-07T19:43:00.1552366Z model : 85 2025-05-07T19:43:00.1552596Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1552678Z stepping : 7 2025-05-07T19:43:00.1552779Z microcode : 0x5003901 2025-05-07T19:43:00.1553031Z cpu MHz : 2999.994 2025-05-07T19:43:00.1553129Z cache size : 36608 KB 2025-05-07T19:43:00.1553284Z physical id : 0 2025-05-07T19:43:00.1553387Z siblings : 48 2025-05-07T19:43:00.1553466Z core id : 22 2025-05-07T19:43:00.1553613Z cpu cores : 24 2025-05-07T19:43:00.1553695Z apicid : 45 2025-05-07T19:43:00.1553807Z initial apicid : 45 2025-05-07T19:43:00.1553891Z fpu : yes 2025-05-07T19:43:00.1553983Z fpu_exception : yes 2025-05-07T19:43:00.1554082Z cpuid level : 13 2025-05-07T19:43:00.1554171Z wp : yes 2025-05-07T19:43:00.1556461Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1556873Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1556964Z bogomips : 5999.98 2025-05-07T19:43:00.1557049Z clflush size : 64 2025-05-07T19:43:00.1557146Z cache_alignment : 64 2025-05-07T19:43:00.1557277Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1557368Z power management: 2025-05-07T19:43:00.1557373Z 2025-05-07T19:43:00.1557466Z processor : 71 2025-05-07T19:43:00.1557558Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1557643Z cpu family : 6 2025-05-07T19:43:00.1557722Z model : 85 2025-05-07T19:43:00.1557894Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1557981Z stepping : 7 2025-05-07T19:43:00.1558067Z microcode : 0x5003901 2025-05-07T19:43:00.1558150Z cpu MHz : 2999.994 2025-05-07T19:43:00.1558246Z cache size : 36608 KB 2025-05-07T19:43:00.1558332Z physical id : 0 2025-05-07T19:43:00.1558412Z siblings : 48 2025-05-07T19:43:00.1558515Z core id : 23 2025-05-07T19:43:00.1558596Z cpu cores : 24 2025-05-07T19:43:00.1558681Z apicid : 47 2025-05-07T19:43:00.1558762Z initial apicid : 47 2025-05-07T19:43:00.1558861Z fpu : yes 2025-05-07T19:43:00.1558950Z fpu_exception : yes 2025-05-07T19:43:00.1559029Z cpuid level : 13 2025-05-07T19:43:00.1559110Z wp : yes 2025-05-07T19:43:00.1561337Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1561731Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1561826Z bogomips : 5999.98 2025-05-07T19:43:00.1561913Z clflush size : 64 2025-05-07T19:43:00.1561995Z cache_alignment : 64 2025-05-07T19:43:00.1562137Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1562225Z power management: 2025-05-07T19:43:00.1562230Z 2025-05-07T19:43:00.1562308Z processor : 72 2025-05-07T19:43:00.1562394Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1562486Z cpu family : 6 2025-05-07T19:43:00.1562564Z model : 85 2025-05-07T19:43:00.1562723Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1562824Z stepping : 7 2025-05-07T19:43:00.1562906Z microcode : 0x5003901 2025-05-07T19:43:00.1562983Z cpu MHz : 2999.994 2025-05-07T19:43:00.1563073Z cache size : 36608 KB 2025-05-07T19:43:00.1563180Z physical id : 1 2025-05-07T19:43:00.1563262Z siblings : 48 2025-05-07T19:43:00.1563339Z core id : 0 2025-05-07T19:43:00.1563480Z cpu cores : 24 2025-05-07T19:43:00.1563558Z apicid : 65 2025-05-07T19:43:00.1563642Z initial apicid : 65 2025-05-07T19:43:00.1563724Z fpu : yes 2025-05-07T19:43:00.1563827Z fpu_exception : yes 2025-05-07T19:43:00.1563915Z cpuid level : 13 2025-05-07T19:43:00.1563994Z wp : yes 2025-05-07T19:43:00.1566247Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1566659Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1566743Z bogomips : 5999.98 2025-05-07T19:43:00.1566843Z clflush size : 64 2025-05-07T19:43:00.1566925Z cache_alignment : 64 2025-05-07T19:43:00.1567050Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1567143Z power management: 2025-05-07T19:43:00.1567147Z 2025-05-07T19:43:00.1567223Z processor : 73 2025-05-07T19:43:00.1567307Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1567380Z cpu family : 6 2025-05-07T19:43:00.1567460Z model : 85 2025-05-07T19:43:00.1567605Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1567675Z stepping : 7 2025-05-07T19:43:00.1567758Z microcode : 0x5003901 2025-05-07T19:43:00.1567827Z cpu MHz : 1458.049 2025-05-07T19:43:00.1567906Z cache size : 36608 KB 2025-05-07T19:43:00.1567988Z physical id : 1 2025-05-07T19:43:00.1568072Z siblings : 48 2025-05-07T19:43:00.1568148Z core id : 1 2025-05-07T19:43:00.1568225Z cpu cores : 24 2025-05-07T19:43:00.1568311Z apicid : 67 2025-05-07T19:43:00.1568391Z initial apicid : 67 2025-05-07T19:43:00.1568466Z fpu : yes 2025-05-07T19:43:00.1568550Z fpu_exception : yes 2025-05-07T19:43:00.1568634Z cpuid level : 13 2025-05-07T19:43:00.1568712Z wp : yes 2025-05-07T19:43:00.1570769Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1571152Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1571234Z bogomips : 5999.98 2025-05-07T19:43:00.1571316Z clflush size : 64 2025-05-07T19:43:00.1571408Z cache_alignment : 64 2025-05-07T19:43:00.1571534Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1571610Z power management: 2025-05-07T19:43:00.1571615Z 2025-05-07T19:43:00.1571700Z processor : 74 2025-05-07T19:43:00.1571784Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1571862Z cpu family : 6 2025-05-07T19:43:00.1571935Z model : 85 2025-05-07T19:43:00.1572092Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1572171Z stepping : 7 2025-05-07T19:43:00.1572277Z microcode : 0x5003901 2025-05-07T19:43:00.1572366Z cpu MHz : 2965.764 2025-05-07T19:43:00.1572444Z cache size : 36608 KB 2025-05-07T19:43:00.1572519Z physical id : 1 2025-05-07T19:43:00.1572591Z siblings : 48 2025-05-07T19:43:00.1572675Z core id : 2 2025-05-07T19:43:00.1572750Z cpu cores : 24 2025-05-07T19:43:00.1572821Z apicid : 69 2025-05-07T19:43:00.1572955Z initial apicid : 69 2025-05-07T19:43:00.1573022Z fpu : yes 2025-05-07T19:43:00.1573101Z fpu_exception : yes 2025-05-07T19:43:00.1573177Z cpuid level : 13 2025-05-07T19:43:00.1573256Z wp : yes 2025-05-07T19:43:00.1575305Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1575729Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1575808Z bogomips : 5999.98 2025-05-07T19:43:00.1575884Z clflush size : 64 2025-05-07T19:43:00.1575962Z cache_alignment : 64 2025-05-07T19:43:00.1576091Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1576170Z power management: 2025-05-07T19:43:00.1576175Z 2025-05-07T19:43:00.1576249Z processor : 75 2025-05-07T19:43:00.1576337Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1576411Z cpu family : 6 2025-05-07T19:43:00.1576478Z model : 85 2025-05-07T19:43:00.1576625Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1576714Z stepping : 7 2025-05-07T19:43:00.1576794Z microcode : 0x5003901 2025-05-07T19:43:00.1576864Z cpu MHz : 3406.978 2025-05-07T19:43:00.1576948Z cache size : 36608 KB 2025-05-07T19:43:00.1577022Z physical id : 1 2025-05-07T19:43:00.1577098Z siblings : 48 2025-05-07T19:43:00.1577163Z core id : 3 2025-05-07T19:43:00.1577240Z cpu cores : 24 2025-05-07T19:43:00.1577310Z apicid : 71 2025-05-07T19:43:00.1577386Z initial apicid : 71 2025-05-07T19:43:00.1577456Z fpu : yes 2025-05-07T19:43:00.1577538Z fpu_exception : yes 2025-05-07T19:43:00.1577609Z cpuid level : 13 2025-05-07T19:43:00.1577678Z wp : yes 2025-05-07T19:43:00.1579741Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1580100Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1580182Z bogomips : 5999.98 2025-05-07T19:43:00.1580259Z clflush size : 64 2025-05-07T19:43:00.1580336Z cache_alignment : 64 2025-05-07T19:43:00.1580450Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1580555Z power management: 2025-05-07T19:43:00.1580559Z 2025-05-07T19:43:00.1580651Z processor : 76 2025-05-07T19:43:00.1580751Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1580861Z cpu family : 6 2025-05-07T19:43:00.1580949Z model : 85 2025-05-07T19:43:00.1581111Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1581203Z stepping : 7 2025-05-07T19:43:00.1581323Z microcode : 0x5003901 2025-05-07T19:43:00.1581415Z cpu MHz : 2706.913 2025-05-07T19:43:00.1581509Z cache size : 36608 KB 2025-05-07T19:43:00.1581642Z physical id : 1 2025-05-07T19:43:00.1581733Z siblings : 48 2025-05-07T19:43:00.1581826Z core id : 4 2025-05-07T19:43:00.1581916Z cpu cores : 24 2025-05-07T19:43:00.1582036Z apicid : 73 2025-05-07T19:43:00.1582132Z initial apicid : 73 2025-05-07T19:43:00.1582224Z fpu : yes 2025-05-07T19:43:00.1582324Z fpu_exception : yes 2025-05-07T19:43:00.1582502Z cpuid level : 13 2025-05-07T19:43:00.1582593Z wp : yes 2025-05-07T19:43:00.1584673Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1585092Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1585188Z bogomips : 5999.98 2025-05-07T19:43:00.1585339Z clflush size : 64 2025-05-07T19:43:00.1585460Z cache_alignment : 64 2025-05-07T19:43:00.1585602Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1586271Z power management: 2025-05-07T19:43:00.1586279Z 2025-05-07T19:43:00.1586420Z processor : 77 2025-05-07T19:43:00.1586529Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1586629Z cpu family : 6 2025-05-07T19:43:00.1586726Z model : 85 2025-05-07T19:43:00.1587021Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1587119Z stepping : 7 2025-05-07T19:43:00.1587222Z microcode : 0x5003901 2025-05-07T19:43:00.1587350Z cpu MHz : 2999.994 2025-05-07T19:43:00.1587451Z cache size : 36608 KB 2025-05-07T19:43:00.1587553Z physical id : 1 2025-05-07T19:43:00.1587653Z siblings : 48 2025-05-07T19:43:00.1587777Z core id : 5 2025-05-07T19:43:00.1587873Z cpu cores : 24 2025-05-07T19:43:00.1587967Z apicid : 75 2025-05-07T19:43:00.1588095Z initial apicid : 75 2025-05-07T19:43:00.1588192Z fpu : yes 2025-05-07T19:43:00.1588298Z fpu_exception : yes 2025-05-07T19:43:00.1588396Z cpuid level : 13 2025-05-07T19:43:00.1588494Z wp : yes 2025-05-07T19:43:00.1590730Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1591153Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1591236Z bogomips : 5999.98 2025-05-07T19:43:00.1591324Z clflush size : 64 2025-05-07T19:43:00.1591415Z cache_alignment : 64 2025-05-07T19:43:00.1591575Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1591675Z power management: 2025-05-07T19:43:00.1591680Z 2025-05-07T19:43:00.1591761Z processor : 78 2025-05-07T19:43:00.1591878Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1591960Z cpu family : 6 2025-05-07T19:43:00.1592049Z model : 85 2025-05-07T19:43:00.1592223Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1592322Z stepping : 7 2025-05-07T19:43:00.1592419Z microcode : 0x5003901 2025-05-07T19:43:00.1592580Z cpu MHz : 2999.994 2025-05-07T19:43:00.1592702Z cache size : 36608 KB 2025-05-07T19:43:00.1592792Z physical id : 1 2025-05-07T19:43:00.1592886Z siblings : 48 2025-05-07T19:43:00.1592978Z core id : 6 2025-05-07T19:43:00.1593082Z cpu cores : 24 2025-05-07T19:43:00.1593174Z apicid : 77 2025-05-07T19:43:00.1593266Z initial apicid : 77 2025-05-07T19:43:00.1593372Z fpu : yes 2025-05-07T19:43:00.1593468Z fpu_exception : yes 2025-05-07T19:43:00.1593560Z cpuid level : 13 2025-05-07T19:43:00.1593652Z wp : yes 2025-05-07T19:43:00.1596031Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1596439Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1596568Z bogomips : 5999.98 2025-05-07T19:43:00.1596662Z clflush size : 64 2025-05-07T19:43:00.1596754Z cache_alignment : 64 2025-05-07T19:43:00.1596964Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1597093Z power management: 2025-05-07T19:43:00.1597098Z 2025-05-07T19:43:00.1597194Z processor : 79 2025-05-07T19:43:00.1597290Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1597402Z cpu family : 6 2025-05-07T19:43:00.1597494Z model : 85 2025-05-07T19:43:00.1597661Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1597753Z stepping : 7 2025-05-07T19:43:00.1597857Z microcode : 0x5003901 2025-05-07T19:43:00.1597949Z cpu MHz : 2806.884 2025-05-07T19:43:00.1598034Z cache size : 36608 KB 2025-05-07T19:43:00.1598145Z physical id : 1 2025-05-07T19:43:00.1598224Z siblings : 48 2025-05-07T19:43:00.1598314Z core id : 7 2025-05-07T19:43:00.1598402Z cpu cores : 24 2025-05-07T19:43:00.1598518Z apicid : 79 2025-05-07T19:43:00.1598609Z initial apicid : 79 2025-05-07T19:43:00.1598697Z fpu : yes 2025-05-07T19:43:00.1598791Z fpu_exception : yes 2025-05-07T19:43:00.1598873Z cpuid level : 13 2025-05-07T19:43:00.1598951Z wp : yes 2025-05-07T19:43:00.1601187Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1601589Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1601680Z bogomips : 5999.98 2025-05-07T19:43:00.1601784Z clflush size : 64 2025-05-07T19:43:00.1601870Z cache_alignment : 64 2025-05-07T19:43:00.1601999Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1602087Z power management: 2025-05-07T19:43:00.1602094Z 2025-05-07T19:43:00.1602204Z processor : 80 2025-05-07T19:43:00.1602300Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1602391Z cpu family : 6 2025-05-07T19:43:00.1602507Z model : 85 2025-05-07T19:43:00.1602676Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1602765Z stepping : 7 2025-05-07T19:43:00.1602865Z microcode : 0x5003901 2025-05-07T19:43:00.1602987Z cpu MHz : 2892.143 2025-05-07T19:43:00.1603077Z cache size : 36608 KB 2025-05-07T19:43:00.1603172Z physical id : 1 2025-05-07T19:43:00.1603272Z siblings : 48 2025-05-07T19:43:00.1603365Z core id : 8 2025-05-07T19:43:00.1603445Z cpu cores : 24 2025-05-07T19:43:00.1603527Z apicid : 81 2025-05-07T19:43:00.1603630Z initial apicid : 81 2025-05-07T19:43:00.1603718Z fpu : yes 2025-05-07T19:43:00.1603814Z fpu_exception : yes 2025-05-07T19:43:00.1603915Z cpuid level : 13 2025-05-07T19:43:00.1604001Z wp : yes 2025-05-07T19:43:00.1606207Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1606637Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1606723Z bogomips : 5999.98 2025-05-07T19:43:00.1606798Z clflush size : 64 2025-05-07T19:43:00.1606905Z cache_alignment : 64 2025-05-07T19:43:00.1607034Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1607124Z power management: 2025-05-07T19:43:00.1607175Z 2025-05-07T19:43:00.1607262Z processor : 81 2025-05-07T19:43:00.1607368Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1607446Z cpu family : 6 2025-05-07T19:43:00.1607518Z model : 85 2025-05-07T19:43:00.1607680Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1607754Z stepping : 7 2025-05-07T19:43:00.1607832Z microcode : 0x5003901 2025-05-07T19:43:00.1607908Z cpu MHz : 2734.408 2025-05-07T19:43:00.1608005Z cache size : 36608 KB 2025-05-07T19:43:00.1608083Z physical id : 1 2025-05-07T19:43:00.1608155Z siblings : 48 2025-05-07T19:43:00.1608244Z core id : 9 2025-05-07T19:43:00.1608317Z cpu cores : 24 2025-05-07T19:43:00.1608391Z apicid : 83 2025-05-07T19:43:00.1608472Z initial apicid : 83 2025-05-07T19:43:00.1608556Z fpu : yes 2025-05-07T19:43:00.1608638Z fpu_exception : yes 2025-05-07T19:43:00.1608709Z cpuid level : 13 2025-05-07T19:43:00.1608782Z wp : yes 2025-05-07T19:43:00.1610851Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1611217Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1611306Z bogomips : 5999.98 2025-05-07T19:43:00.1611383Z clflush size : 64 2025-05-07T19:43:00.1611461Z cache_alignment : 64 2025-05-07T19:43:00.1611584Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1611673Z power management: 2025-05-07T19:43:00.1611678Z 2025-05-07T19:43:00.1611754Z processor : 82 2025-05-07T19:43:00.1611835Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1611926Z cpu family : 6 2025-05-07T19:43:00.1611998Z model : 85 2025-05-07T19:43:00.1612146Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1612232Z stepping : 7 2025-05-07T19:43:00.1612305Z microcode : 0x5003901 2025-05-07T19:43:00.1612383Z cpu MHz : 2999.994 2025-05-07T19:43:00.1612467Z cache size : 36608 KB 2025-05-07T19:43:00.1612555Z physical id : 1 2025-05-07T19:43:00.1612628Z siblings : 48 2025-05-07T19:43:00.1612699Z core id : 10 2025-05-07T19:43:00.1612773Z cpu cores : 24 2025-05-07T19:43:00.1612864Z apicid : 85 2025-05-07T19:43:00.1612942Z initial apicid : 85 2025-05-07T19:43:00.1613015Z fpu : yes 2025-05-07T19:43:00.1613102Z fpu_exception : yes 2025-05-07T19:43:00.1613172Z cpuid level : 13 2025-05-07T19:43:00.1613242Z wp : yes 2025-05-07T19:43:00.1615310Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1616052Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1616126Z bogomips : 5999.98 2025-05-07T19:43:00.1616214Z clflush size : 64 2025-05-07T19:43:00.1616293Z cache_alignment : 64 2025-05-07T19:43:00.1616411Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1616494Z power management: 2025-05-07T19:43:00.1616506Z 2025-05-07T19:43:00.1616584Z processor : 83 2025-05-07T19:43:00.1616719Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1616795Z cpu family : 6 2025-05-07T19:43:00.1616881Z model : 85 2025-05-07T19:43:00.1617026Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1617101Z stepping : 7 2025-05-07T19:43:00.1617193Z microcode : 0x5003901 2025-05-07T19:43:00.1617269Z cpu MHz : 2800.445 2025-05-07T19:43:00.1617355Z cache size : 36608 KB 2025-05-07T19:43:00.1617430Z physical id : 1 2025-05-07T19:43:00.1617507Z siblings : 48 2025-05-07T19:43:00.1617578Z core id : 11 2025-05-07T19:43:00.1617652Z cpu cores : 24 2025-05-07T19:43:00.1617723Z apicid : 87 2025-05-07T19:43:00.1617821Z initial apicid : 87 2025-05-07T19:43:00.1617896Z fpu : yes 2025-05-07T19:43:00.1617976Z fpu_exception : yes 2025-05-07T19:43:00.1618063Z cpuid level : 13 2025-05-07T19:43:00.1618134Z wp : yes 2025-05-07T19:43:00.1620197Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1620569Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1620649Z bogomips : 5999.98 2025-05-07T19:43:00.1620722Z clflush size : 64 2025-05-07T19:43:00.1620811Z cache_alignment : 64 2025-05-07T19:43:00.1620933Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1621011Z power management: 2025-05-07T19:43:00.1621015Z 2025-05-07T19:43:00.1621095Z processor : 84 2025-05-07T19:43:00.1621194Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1621274Z cpu family : 6 2025-05-07T19:43:00.1621343Z model : 85 2025-05-07T19:43:00.1621508Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1621583Z stepping : 7 2025-05-07T19:43:00.1621665Z microcode : 0x5003901 2025-05-07T19:43:00.1621735Z cpu MHz : 2999.994 2025-05-07T19:43:00.1621819Z cache size : 36608 KB 2025-05-07T19:43:00.1621893Z physical id : 1 2025-05-07T19:43:00.1621964Z siblings : 48 2025-05-07T19:43:00.1622052Z core id : 12 2025-05-07T19:43:00.1622125Z cpu cores : 24 2025-05-07T19:43:00.1622195Z apicid : 89 2025-05-07T19:43:00.1622270Z initial apicid : 89 2025-05-07T19:43:00.1622352Z fpu : yes 2025-05-07T19:43:00.1622430Z fpu_exception : yes 2025-05-07T19:43:00.1622510Z cpuid level : 13 2025-05-07T19:43:00.1622592Z wp : yes 2025-05-07T19:43:00.1624651Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1625061Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1625148Z bogomips : 5999.98 2025-05-07T19:43:00.1625226Z clflush size : 64 2025-05-07T19:43:00.1625308Z cache_alignment : 64 2025-05-07T19:43:00.1625438Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1625515Z power management: 2025-05-07T19:43:00.1625519Z 2025-05-07T19:43:00.1625599Z processor : 85 2025-05-07T19:43:00.1625684Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1625773Z cpu family : 6 2025-05-07T19:43:00.1625844Z model : 85 2025-05-07T19:43:00.1626056Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1626149Z stepping : 7 2025-05-07T19:43:00.1626229Z microcode : 0x5003901 2025-05-07T19:43:00.1626303Z cpu MHz : 2999.994 2025-05-07T19:43:00.1626383Z cache size : 36608 KB 2025-05-07T19:43:00.1626470Z physical id : 1 2025-05-07T19:43:00.1626546Z siblings : 48 2025-05-07T19:43:00.1626614Z core id : 13 2025-05-07T19:43:00.1626709Z cpu cores : 24 2025-05-07T19:43:00.1626781Z apicid : 91 2025-05-07T19:43:00.1626863Z initial apicid : 91 2025-05-07T19:43:00.1626939Z fpu : yes 2025-05-07T19:43:00.1627034Z fpu_exception : yes 2025-05-07T19:43:00.1627108Z cpuid level : 13 2025-05-07T19:43:00.1627178Z wp : yes 2025-05-07T19:43:00.1629248Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1629617Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1629692Z bogomips : 5999.98 2025-05-07T19:43:00.1629789Z clflush size : 64 2025-05-07T19:43:00.1629873Z cache_alignment : 64 2025-05-07T19:43:00.1630001Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1630093Z power management: 2025-05-07T19:43:00.1630098Z 2025-05-07T19:43:00.1630181Z processor : 86 2025-05-07T19:43:00.1630269Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1630346Z cpu family : 6 2025-05-07T19:43:00.1630430Z model : 85 2025-05-07T19:43:00.1630581Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1630650Z stepping : 7 2025-05-07T19:43:00.1630742Z microcode : 0x5003901 2025-05-07T19:43:00.1630820Z cpu MHz : 2743.200 2025-05-07T19:43:00.1630900Z cache size : 36608 KB 2025-05-07T19:43:00.1630977Z physical id : 1 2025-05-07T19:43:00.1631060Z siblings : 48 2025-05-07T19:43:00.1631133Z core id : 14 2025-05-07T19:43:00.1631204Z cpu cores : 24 2025-05-07T19:43:00.1631286Z apicid : 93 2025-05-07T19:43:00.1631364Z initial apicid : 93 2025-05-07T19:43:00.1631435Z fpu : yes 2025-05-07T19:43:00.1631518Z fpu_exception : yes 2025-05-07T19:43:00.1631602Z cpuid level : 13 2025-05-07T19:43:00.1631684Z wp : yes 2025-05-07T19:43:00.1634050Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1634557Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1634644Z bogomips : 5999.98 2025-05-07T19:43:00.1634730Z clflush size : 64 2025-05-07T19:43:00.1634824Z cache_alignment : 64 2025-05-07T19:43:00.1634954Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1635039Z power management: 2025-05-07T19:43:00.1635044Z 2025-05-07T19:43:00.1635144Z processor : 87 2025-05-07T19:43:00.1635236Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1635313Z cpu family : 6 2025-05-07T19:43:00.1635398Z model : 85 2025-05-07T19:43:00.1635576Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1635705Z stepping : 7 2025-05-07T19:43:00.1635789Z microcode : 0x5003901 2025-05-07T19:43:00.1635881Z cpu MHz : 2999.994 2025-05-07T19:43:00.1635963Z cache size : 36608 KB 2025-05-07T19:43:00.1636043Z physical id : 1 2025-05-07T19:43:00.1636121Z siblings : 48 2025-05-07T19:43:00.1636204Z core id : 15 2025-05-07T19:43:00.1636281Z cpu cores : 24 2025-05-07T19:43:00.1636360Z apicid : 95 2025-05-07T19:43:00.1636456Z initial apicid : 95 2025-05-07T19:43:00.1636531Z fpu : yes 2025-05-07T19:43:00.1636613Z fpu_exception : yes 2025-05-07T19:43:00.1636697Z cpuid level : 13 2025-05-07T19:43:00.1636776Z wp : yes 2025-05-07T19:43:00.1639012Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1639409Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1639509Z bogomips : 5999.98 2025-05-07T19:43:00.1639590Z clflush size : 64 2025-05-07T19:43:00.1639673Z cache_alignment : 64 2025-05-07T19:43:00.1639818Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1639903Z power management: 2025-05-07T19:43:00.1639907Z 2025-05-07T19:43:00.1639983Z processor : 88 2025-05-07T19:43:00.1640084Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1640168Z cpu family : 6 2025-05-07T19:43:00.1640238Z model : 85 2025-05-07T19:43:00.1640394Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1640478Z stepping : 7 2025-05-07T19:43:00.1640563Z microcode : 0x5003901 2025-05-07T19:43:00.1640641Z cpu MHz : 2999.994 2025-05-07T19:43:00.1640741Z cache size : 36608 KB 2025-05-07T19:43:00.1640824Z physical id : 1 2025-05-07T19:43:00.1640904Z siblings : 48 2025-05-07T19:43:00.1640980Z core id : 16 2025-05-07T19:43:00.1641071Z cpu cores : 24 2025-05-07T19:43:00.1641147Z apicid : 97 2025-05-07T19:43:00.1641227Z initial apicid : 97 2025-05-07T19:43:00.1641302Z fpu : yes 2025-05-07T19:43:00.1641397Z fpu_exception : yes 2025-05-07T19:43:00.1641478Z cpuid level : 13 2025-05-07T19:43:00.1641553Z wp : yes 2025-05-07T19:43:00.1643793Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1644238Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1644320Z bogomips : 5999.98 2025-05-07T19:43:00.1644408Z clflush size : 64 2025-05-07T19:43:00.1644496Z cache_alignment : 64 2025-05-07T19:43:00.1644626Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1644718Z power management: 2025-05-07T19:43:00.1644723Z 2025-05-07T19:43:00.1644807Z processor : 89 2025-05-07T19:43:00.1644895Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1644983Z cpu family : 6 2025-05-07T19:43:00.1645058Z model : 85 2025-05-07T19:43:00.1645321Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1645398Z stepping : 7 2025-05-07T19:43:00.1645484Z microcode : 0x5003901 2025-05-07T19:43:00.1645610Z cpu MHz : 2999.994 2025-05-07T19:43:00.1645691Z cache size : 36608 KB 2025-05-07T19:43:00.1645776Z physical id : 1 2025-05-07T19:43:00.1645847Z siblings : 48 2025-05-07T19:43:00.1645919Z core id : 17 2025-05-07T19:43:00.1645990Z cpu cores : 24 2025-05-07T19:43:00.1646077Z apicid : 99 2025-05-07T19:43:00.1646154Z initial apicid : 99 2025-05-07T19:43:00.1646224Z fpu : yes 2025-05-07T19:43:00.1646300Z fpu_exception : yes 2025-05-07T19:43:00.1646389Z cpuid level : 13 2025-05-07T19:43:00.1646458Z wp : yes 2025-05-07T19:43:00.1648511Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1648878Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1648956Z bogomips : 5999.98 2025-05-07T19:43:00.1649035Z clflush size : 64 2025-05-07T19:43:00.1649121Z cache_alignment : 64 2025-05-07T19:43:00.1649241Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1649321Z power management: 2025-05-07T19:43:00.1649325Z 2025-05-07T19:43:00.1649421Z processor : 90 2025-05-07T19:43:00.1649502Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1649577Z cpu family : 6 2025-05-07T19:43:00.1649650Z model : 85 2025-05-07T19:43:00.1649804Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1649876Z stepping : 7 2025-05-07T19:43:00.1649956Z microcode : 0x5003901 2025-05-07T19:43:00.1650035Z cpu MHz : 2999.994 2025-05-07T19:43:00.1650111Z cache size : 36608 KB 2025-05-07T19:43:00.1650192Z physical id : 1 2025-05-07T19:43:00.1650268Z siblings : 48 2025-05-07T19:43:00.1650345Z core id : 18 2025-05-07T19:43:00.1650420Z cpu cores : 24 2025-05-07T19:43:00.1650494Z apicid : 101 2025-05-07T19:43:00.1650580Z initial apicid : 101 2025-05-07T19:43:00.1650650Z fpu : yes 2025-05-07T19:43:00.1650736Z fpu_exception : yes 2025-05-07T19:43:00.1650812Z cpuid level : 13 2025-05-07T19:43:00.1650893Z wp : yes 2025-05-07T19:43:00.1652940Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1653367Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1653441Z bogomips : 5999.98 2025-05-07T19:43:00.1653517Z clflush size : 64 2025-05-07T19:43:00.1653598Z cache_alignment : 64 2025-05-07T19:43:00.1653733Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1653805Z power management: 2025-05-07T19:43:00.1653809Z 2025-05-07T19:43:00.1653880Z processor : 91 2025-05-07T19:43:00.1653969Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1654039Z cpu family : 6 2025-05-07T19:43:00.1654109Z model : 85 2025-05-07T19:43:00.1654253Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1654335Z stepping : 7 2025-05-07T19:43:00.1654409Z microcode : 0x5003901 2025-05-07T19:43:00.1654479Z cpu MHz : 2999.994 2025-05-07T19:43:00.1654561Z cache size : 36608 KB 2025-05-07T19:43:00.1654676Z physical id : 1 2025-05-07T19:43:00.1654749Z siblings : 48 2025-05-07T19:43:00.1654825Z core id : 19 2025-05-07T19:43:00.1654910Z cpu cores : 24 2025-05-07T19:43:00.1654982Z apicid : 103 2025-05-07T19:43:00.1655060Z initial apicid : 103 2025-05-07T19:43:00.1655146Z fpu : yes 2025-05-07T19:43:00.1655232Z fpu_exception : yes 2025-05-07T19:43:00.1655307Z cpuid level : 13 2025-05-07T19:43:00.1655374Z wp : yes 2025-05-07T19:43:00.1657437Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1657800Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1657885Z bogomips : 5999.98 2025-05-07T19:43:00.1657959Z clflush size : 64 2025-05-07T19:43:00.1658042Z cache_alignment : 64 2025-05-07T19:43:00.1658163Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1658249Z power management: 2025-05-07T19:43:00.1658253Z 2025-05-07T19:43:00.1658327Z processor : 92 2025-05-07T19:43:00.1658411Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1658491Z cpu family : 6 2025-05-07T19:43:00.1658560Z model : 85 2025-05-07T19:43:00.1658710Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1658784Z stepping : 7 2025-05-07T19:43:00.1658887Z microcode : 0x5003901 2025-05-07T19:43:00.1658962Z cpu MHz : 2999.994 2025-05-07T19:43:00.1659041Z cache size : 36608 KB 2025-05-07T19:43:00.1659129Z physical id : 1 2025-05-07T19:43:00.1659206Z siblings : 48 2025-05-07T19:43:00.1659275Z core id : 20 2025-05-07T19:43:00.1659350Z cpu cores : 24 2025-05-07T19:43:00.1659444Z apicid : 105 2025-05-07T19:43:00.1659523Z initial apicid : 105 2025-05-07T19:43:00.1659592Z fpu : yes 2025-05-07T19:43:00.1659687Z fpu_exception : yes 2025-05-07T19:43:00.1659762Z cpuid level : 13 2025-05-07T19:43:00.1659832Z wp : yes 2025-05-07T19:43:00.1661895Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1662261Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1662394Z bogomips : 5999.98 2025-05-07T19:43:00.1662483Z clflush size : 64 2025-05-07T19:43:00.1662561Z cache_alignment : 64 2025-05-07T19:43:00.1662679Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1662758Z power management: 2025-05-07T19:43:00.1662762Z 2025-05-07T19:43:00.1662852Z processor : 93 2025-05-07T19:43:00.1662932Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1663006Z cpu family : 6 2025-05-07T19:43:00.1663091Z model : 85 2025-05-07T19:43:00.1663240Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1663313Z stepping : 7 2025-05-07T19:43:00.1663391Z microcode : 0x5003901 2025-05-07T19:43:00.1663487Z cpu MHz : 2999.994 2025-05-07T19:43:00.1663567Z cache size : 36608 KB 2025-05-07T19:43:00.1663643Z physical id : 1 2025-05-07T19:43:00.1663734Z siblings : 48 2025-05-07T19:43:00.1663808Z core id : 21 2025-05-07T19:43:00.1663930Z cpu cores : 24 2025-05-07T19:43:00.1664007Z apicid : 107 2025-05-07T19:43:00.1664099Z initial apicid : 107 2025-05-07T19:43:00.1664169Z fpu : yes 2025-05-07T19:43:00.1664252Z fpu_exception : yes 2025-05-07T19:43:00.1664344Z cpuid level : 13 2025-05-07T19:43:00.1664417Z wp : yes 2025-05-07T19:43:00.1666471Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1666850Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1666934Z bogomips : 5999.98 2025-05-07T19:43:00.1667008Z clflush size : 64 2025-05-07T19:43:00.1667098Z cache_alignment : 64 2025-05-07T19:43:00.1667216Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1667291Z power management: 2025-05-07T19:43:00.1667295Z 2025-05-07T19:43:00.1667369Z processor : 94 2025-05-07T19:43:00.1667460Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1667532Z cpu family : 6 2025-05-07T19:43:00.1667600Z model : 85 2025-05-07T19:43:00.1667768Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1667842Z stepping : 7 2025-05-07T19:43:00.1667917Z microcode : 0x5003901 2025-05-07T19:43:00.1667995Z cpu MHz : 1779.245 2025-05-07T19:43:00.1668083Z cache size : 36608 KB 2025-05-07T19:43:00.1668160Z physical id : 1 2025-05-07T19:43:00.1668236Z siblings : 48 2025-05-07T19:43:00.1668328Z core id : 22 2025-05-07T19:43:00.1668400Z cpu cores : 24 2025-05-07T19:43:00.1668474Z apicid : 109 2025-05-07T19:43:00.1668555Z initial apicid : 109 2025-05-07T19:43:00.1668637Z fpu : yes 2025-05-07T19:43:00.1668716Z fpu_exception : yes 2025-05-07T19:43:00.1668788Z cpuid level : 13 2025-05-07T19:43:00.1668861Z wp : yes 2025-05-07T19:43:00.1670934Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1671305Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1671395Z bogomips : 5999.98 2025-05-07T19:43:00.1671519Z clflush size : 64 2025-05-07T19:43:00.1671596Z cache_alignment : 64 2025-05-07T19:43:00.1671729Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1671804Z power management: 2025-05-07T19:43:00.1671809Z 2025-05-07T19:43:00.1671888Z processor : 95 2025-05-07T19:43:00.1671972Z vendor_id : GenuineIntel 2025-05-07T19:43:00.1672058Z cpu family : 6 2025-05-07T19:43:00.1672132Z model : 85 2025-05-07T19:43:00.1672283Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:00.1672362Z stepping : 7 2025-05-07T19:43:00.1672436Z microcode : 0x5003901 2025-05-07T19:43:00.1672579Z cpu MHz : 2999.994 2025-05-07T19:43:00.1672663Z cache size : 36608 KB 2025-05-07T19:43:00.1672750Z physical id : 1 2025-05-07T19:43:00.1672827Z siblings : 48 2025-05-07T19:43:00.1673070Z core id : 23 2025-05-07T19:43:00.1673173Z cpu cores : 24 2025-05-07T19:43:00.1673250Z apicid : 111 2025-05-07T19:43:00.1673401Z initial apicid : 111 2025-05-07T19:43:00.1673483Z fpu : yes 2025-05-07T19:43:00.1673584Z fpu_exception : yes 2025-05-07T19:43:00.1673662Z cpuid level : 13 2025-05-07T19:43:00.1673738Z wp : yes 2025-05-07T19:43:00.1675973Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:00.1676365Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:00.1676451Z bogomips : 5999.98 2025-05-07T19:43:00.1676545Z clflush size : 64 2025-05-07T19:43:00.1676631Z cache_alignment : 64 2025-05-07T19:43:00.1676760Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:00.1676842Z power management: 2025-05-07T19:43:00.1676859Z 2025-05-07T19:43:00.1676863Z 2025-05-07T19:43:00.1676976Z ################################################################################ 2025-05-07T19:43:00.1677074Z [INFO] Print PCI info ... 2025-05-07T19:43:00.1677160Z + lspci -v 2025-05-07T19:43:00.1677179Z 2025-05-07T19:43:00.1677370Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:00.1677484Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:00.1677603Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:00.1677607Z 2025-05-07T19:43:00.1677825Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:00.1677915Z Physical Slot: 1 2025-05-07T19:43:00.1678025Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:00.1678030Z 2025-05-07T19:43:00.1678301Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:00.1678398Z Physical Slot: 1 2025-05-07T19:43:00.1678528Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:00.1678533Z 2025-05-07T19:43:00.1678821Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:00.1678909Z Physical Slot: 3 2025-05-07T19:43:00.1679020Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:00.1679156Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:00.1679294Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:00.1679298Z 2025-05-07T19:43:00.1679620Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:00.1679725Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:00.1679817Z Physical Slot: 4 2025-05-07T19:43:00.1679948Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:00.1680156Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:00.1680277Z Capabilities: 2025-05-07T19:43:00.1680367Z Kernel driver in use: nvme 2025-05-07T19:43:00.1680371Z 2025-05-07T19:43:00.1680589Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:00.1680688Z Physical Slot: 5 2025-05-07T19:43:00.1680804Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:00.1680956Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:00.1681091Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:00.1681253Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:00.1681354Z Capabilities: 2025-05-07T19:43:00.1681445Z Kernel driver in use: ena 2025-05-07T19:43:00.1681449Z 2025-05-07T19:43:00.1681453Z 2025-05-07T19:43:00.1681652Z ################################################################################ 2025-05-07T19:43:00.1681763Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:00.1681851Z + uname -a 2025-05-07T19:43:00.1681856Z 2025-05-07T19:43:00.1682271Z Linux f3f10d3a0ffb 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:00.1682277Z 2025-05-07T19:43:00.1682362Z + uname -m 2025-05-07T19:43:00.1682367Z 2025-05-07T19:43:00.1682440Z x86_64 2025-05-07T19:43:00.1682444Z 2025-05-07T19:43:00.1682539Z + cat /proc/version 2025-05-07T19:43:00.1682543Z 2025-05-07T19:43:00.1683155Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:00.1683160Z 2025-05-07T19:43:00.1683249Z + cat /etc/os-release 2025-05-07T19:43:00.1683253Z 2025-05-07T19:43:00.1683344Z NAME="Amazon Linux" 2025-05-07T19:43:00.1683419Z VERSION="2023" 2025-05-07T19:43:00.1683494Z ID="amzn" 2025-05-07T19:43:00.1683572Z ID_LIKE="fedora" 2025-05-07T19:43:00.1683676Z VERSION_ID="2023" 2025-05-07T19:43:00.1683778Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:00.1683886Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:00.1683971Z ANSI_COLOR="0;33" 2025-05-07T19:43:00.1684095Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:00.1684285Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:00.1684457Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:00.1684622Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:00.1684819Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:00.1684900Z VENDOR_NAME="AWS" 2025-05-07T19:43:00.1685023Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:00.1685109Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:00.1685114Z 2025-05-07T19:43:00.1726293Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:00.1726460Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:00.1726803Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:00.1726889Z env: 2025-05-07T19:43:00.1727008Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:00.1727096Z BUILD_ENV: build_binary 2025-05-07T19:43:00.1727199Z BUILD_TARGET: default 2025-05-07T19:43:00.1727278Z BUILD_VARIANT: cuda 2025-05-07T19:43:00.1727364Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:00.1727448Z ##[endgroup] 2025-05-07T19:43:00.5778662Z ################################################################################ 2025-05-07T19:43:00.5780483Z [INFO] Printing general display info ... 2025-05-07T19:43:00.5790960Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:00.6681686Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:00.6687566Z /usr/bin/sudo 2025-05-07T19:43:00.6696322Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:00.6703175Z /usr/bin/yum 2025-05-07T19:43:00.6703801Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:00.6724600Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:00.8886216Z Last metadata expiration check: 0:00:17 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:43:00.9833069Z Dependencies resolved. 2025-05-07T19:43:01.0044282Z Nothing to do. 2025-05-07T19:43:01.0045170Z Complete! 2025-05-07T19:43:01.0684545Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:01.0707487Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:01.2805045Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:43 2025. 2025-05-07T19:43:01.3316423Z Dependencies resolved. 2025-05-07T19:43:01.3479120Z ================================================================================ 2025-05-07T19:43:01.3480231Z Package Arch Version Repository Size 2025-05-07T19:43:01.3481040Z ================================================================================ 2025-05-07T19:43:01.3481455Z Installing: 2025-05-07T19:43:01.3482215Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:01.3482738Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:01.3483041Z 2025-05-07T19:43:01.3483138Z Transaction Summary 2025-05-07T19:43:01.3483389Z ================================================================================ 2025-05-07T19:43:01.3483719Z Install 2 Packages 2025-05-07T19:43:01.3483858Z 2025-05-07T19:43:01.3483982Z Total download size: 347 k 2025-05-07T19:43:01.3484250Z Installed size: 883 k 2025-05-07T19:43:01.3484512Z Downloading Packages: 2025-05-07T19:43:01.6392727Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.1 MB/s | 28 kB 00:00 2025-05-07T19:43:01.6426841Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 11 MB/s | 319 kB 00:00 2025-05-07T19:43:01.6436569Z -------------------------------------------------------------------------------- 2025-05-07T19:43:01.6437423Z Total 1.1 MB/s | 347 kB 00:00 2025-05-07T19:43:01.6651253Z Running transaction check 2025-05-07T19:43:01.6705244Z Transaction check succeeded. 2025-05-07T19:43:01.6705926Z Running transaction test 2025-05-07T19:43:01.6861119Z Transaction test succeeded. 2025-05-07T19:43:01.6862662Z Running transaction 2025-05-07T19:43:01.7134389Z Preparing : 1/1 2025-05-07T19:43:01.7213560Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:01.7245319Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:02.7716357Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:02.7719168Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:02.8080438Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:02.8082124Z 2025-05-07T19:43:02.8082407Z Installed: 2025-05-07T19:43:02.8083431Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:02.8084429Z 2025-05-07T19:43:02.8084665Z Complete! 2025-05-07T19:43:02.8434757Z + hostname 2025-05-07T19:43:02.8435467Z 2025-05-07T19:43:02.8439647Z f3f10d3a0ffb 2025-05-07T19:43:02.8439970Z 2025-05-07T19:43:02.8440606Z + sudo lshw -C display 2025-05-07T19:43:02.8440955Z 2025-05-07T19:43:03.0393292Z *-display UNCLAIMED 2025-05-07T19:43:03.0394799Z description: VGA compatible controller 2025-05-07T19:43:03.0396265Z product: Amazon.com, Inc. 2025-05-07T19:43:03.0397114Z vendor: Amazon.com, Inc. 2025-05-07T19:43:03.0397564Z physical id: 3 2025-05-07T19:43:03.0397830Z bus info: pci@0000:00:03.0 2025-05-07T19:43:03.0398091Z version: 00 2025-05-07T19:43:03.0398334Z width: 32 bits 2025-05-07T19:43:03.0398551Z clock: 33MHz 2025-05-07T19:43:03.0398807Z capabilities: vga_controller bus_master 2025-05-07T19:43:03.0399379Z configuration: latency=0 2025-05-07T19:43:03.0399699Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:03.0414493Z 2025-05-07T19:43:03.0415411Z ################################################################################ 2025-05-07T19:43:03.0416915Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:03.0520692Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:03.0546481Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.0547221Z [CHECK] nvidia-smi not found 2025-05-07T19:43:03.0547540Z ################################################################################ 2025-05-07T19:43:03.0547875Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:03.0654141Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:03.0676919Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.0677469Z [CHECK] rocminfo not found 2025-05-07T19:43:03.0681977Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:03.0684160Z [CHECK] rocm-smi not found 2025-05-07T19:43:03.0751413Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:03.0751946Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:03.0752704Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:03.0753257Z env: 2025-05-07T19:43:03.0753516Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:03.0753969Z BUILD_ENV: build_binary 2025-05-07T19:43:03.0754243Z BUILD_TARGET: default 2025-05-07T19:43:03.0754530Z BUILD_VARIANT: cuda 2025-05-07T19:43:03.0754821Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:03.0755094Z ##[endgroup] 2025-05-07T19:43:03.5356444Z ################################################################################ 2025-05-07T19:43:03.5356910Z # Setup Miniconda 2025-05-07T19:43:03.5357193Z # 2025-05-07T19:43:03.5380505Z # [2025-05-07T19:43:03.537Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:03.5381078Z ################################################################################ 2025-05-07T19:43:03.5381500Z 2025-05-07T19:43:03.5413072Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:03.6234643Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:03.6235056Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:03.6235291Z 2025-05-07T19:43:03.6253381Z 2025-05-07T19:43:03.6253794Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:03.6275822Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:05.0099126Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:05.0099727Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:05.0100018Z 2025-05-07T19:43:05.0239324Z PREFIX=/github/home/miniconda 2025-05-07T19:43:05.3806838Z Unpacking payload ... 2025-05-07T19:43:05.8572445Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:06.5124476Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:08.3498790Z 2025-05-07T19:43:08.3499352Z Installing base environment... 2025-05-07T19:43:08.3499601Z 2025-05-07T19:43:09.3320657Z Preparing transaction: ...working... done 2025-05-07T19:43:12.1634998Z Executing transaction: ...working... done 2025-05-07T19:43:12.7069734Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:12.7746126Z installation finished. 2025-05-07T19:43:12.7746798Z 2025-05-07T19:43:12.7747336Z + rm -f miniconda.sh 2025-05-07T19:43:12.7747847Z 2025-05-07T19:43:12.7914462Z 2025-05-07T19:43:12.7915360Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:12.7915935Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:12.7916191Z 2025-05-07T19:43:13.1481872Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:13.1483112Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:13.1484203Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:13.1485277Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:13.1486774Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:13.1487688Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:13.1488177Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:13.1488658Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:13.1489157Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:13.1489773Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:13.1490671Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:13.1491094Z modified /github/home/.bashrc 2025-05-07T19:43:13.1491298Z 2025-05-07T19:43:13.1491535Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:13.1491884Z 2025-05-07T19:43:13.2001979Z 2025-05-07T19:43:13.2002681Z + . /github/home/.bashrc 2025-05-07T19:43:13.2003206Z 2025-05-07T19:43:13.9716195Z 2025-05-07T19:43:13.9716917Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:13.9751783Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:25.6271815Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:27.0818937Z Solving environment: \ | / - \ | / - \ | / done 2025-05-07T19:43:27.1712765Z 2025-05-07T19:43:27.1713683Z ## Package Plan ## 2025-05-07T19:43:27.1714192Z 2025-05-07T19:43:27.1714589Z environment location: /github/home/miniconda 2025-05-07T19:43:27.1715327Z 2025-05-07T19:43:27.1715626Z added / updated specs: 2025-05-07T19:43:27.1716131Z - conda-libmamba-solver 2025-05-07T19:43:27.1716413Z - libarchive 2025-05-07T19:43:27.1716631Z - libmamba 2025-05-07T19:43:27.1716858Z - libmambapy 2025-05-07T19:43:27.1716990Z 2025-05-07T19:43:27.1716996Z 2025-05-07T19:43:27.1717122Z The following packages will be downloaded: 2025-05-07T19:43:27.1717365Z 2025-05-07T19:43:27.1717545Z package | build 2025-05-07T19:43:27.1717900Z ---------------------------|----------------- 2025-05-07T19:43:27.1718358Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:27.1718891Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:27.1719346Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:27.1719862Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:27.1720358Z ------------------------------------------------------------ 2025-05-07T19:43:27.1720713Z Total: 1.4 MB 2025-05-07T19:43:27.1720936Z 2025-05-07T19:43:27.1721071Z The following packages will be UPDATED: 2025-05-07T19:43:27.1721286Z 2025-05-07T19:43:27.1725740Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:27.1726999Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:27.1727429Z 2025-05-07T19:43:27.1727685Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:27.1728028Z 2025-05-07T19:43:27.1728374Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:27.1729251Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:27.1729777Z 2025-05-07T19:43:27.1729781Z 2025-05-07T19:43:27.1729785Z 2025-05-07T19:43:27.1729950Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:27.1730336Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:27.1730594Z 2025-05-07T19:43:27.1731088Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:27.1731347Z 2025-05-07T19:43:27.1731356Z 2025-05-07T19:43:27.1733821Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:27.1734099Z 2025-05-07T19:43:27.1734103Z 2025-05-07T19:43:27.1734313Z 2025-05-07T19:43:27.2258898Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:27.2259500Z 2025-05-07T19:43:27.2260617Z 2025-05-07T19:43:27.2339212Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:27.2339830Z 2025-05-07T19:43:27.2339843Z 2025-05-07T19:43:27.2410057Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:27.2410492Z 2025-05-07T19:43:27.2410711Z 2025-05-07T19:43:27.2410715Z 2025-05-07T19:43:27.2431871Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:27.2432203Z 2025-05-07T19:43:27.2660395Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:27.2660678Z 2025-05-07T19:43:27.2660683Z 2025-05-07T19:43:27.2660687Z 2025-05-07T19:43:27.2688162Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:27.2688525Z 2025-05-07T19:43:27.2699672Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:27.3638205Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:27.3638660Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:27.3646915Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:27.3648013Z 2025-05-07T19:43:27.3648222Z 2025-05-07T19:43:27.3648467Z  2025-05-07T19:43:27.3648699Z 2025-05-07T19:43:27.3648703Z 2025-05-07T19:43:27.3651939Z  2025-05-07T19:43:27.3652162Z 2025-05-07T19:43:27.3652174Z 2025-05-07T19:43:27.3652193Z 2025-05-07T19:43:27.3652381Z  done 2025-05-07T19:43:27.4659834Z Preparing transaction: \ done 2025-05-07T19:43:27.5669505Z Verifying transaction: / done 2025-05-07T19:43:28.8702016Z Executing transaction: \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:30.4317768Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:30.4353585Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:31.1793603Z Channels: 2025-05-07T19:43:31.1794243Z - defaults 2025-05-07T19:43:31.1794870Z Platform: linux-64 2025-05-07T19:43:32.2296394Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:32.3618669Z Solving environment: / - Channels: 2025-05-07T19:43:32.3619594Z - defaults 2025-05-07T19:43:32.3620220Z Platform: linux-64 2025-05-07T19:43:32.6385432Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:32.8466124Z Solving environment: / - \ done 2025-05-07T19:43:32.9673307Z | done 2025-05-07T19:43:33.0305600Z 2025-05-07T19:43:33.0306091Z ## Package Plan ## 2025-05-07T19:43:33.0306593Z 2025-05-07T19:43:33.0306738Z environment location: /github/home/miniconda 2025-05-07T19:43:33.0307011Z 2025-05-07T19:43:33.0307131Z added / updated specs: 2025-05-07T19:43:33.0307410Z - conda 2025-05-07T19:43:33.0307533Z 2025-05-07T19:43:33.0307537Z 2025-05-07T19:43:33.0307684Z The following packages will be downloaded: 2025-05-07T19:43:33.0307915Z 2025-05-07T19:43:33.0308034Z package | build 2025-05-07T19:43:33.0308389Z ---------------------------|----------------- 2025-05-07T19:43:33.0308862Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:33.0309282Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:33.0309681Z ------------------------------------------------------------ 2025-05-07T19:43:33.0310027Z Total: 1.4 MB 2025-05-07T19:43:33.0310244Z 2025-05-07T19:43:33.0310377Z The following packages will be UPDATED: 2025-05-07T19:43:33.0310599Z 2025-05-07T19:43:33.0310940Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:33.0311692Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:33.0311957Z 2025-05-07T19:43:33.0311961Z 2025-05-07T19:43:33.0311965Z 2025-05-07T19:43:33.0312131Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:33.0312623Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:33.0312879Z 2025-05-07T19:43:33.0725304Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:33.0725596Z 2025-05-07T19:43:33.0861060Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:33.2684380Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:33.2685177Z 2025-05-07T19:43:33.2686232Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:33.2686493Z 2025-05-07T19:43:33.2734570Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:33.2735010Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:33.2735579Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:33.2735914Z 2025-05-07T19:43:33.2736152Z 2025-05-07T19:43:33.2736342Z  done 2025-05-07T19:43:33.3745834Z Preparing transaction: - done 2025-05-07T19:43:33.4754658Z Verifying transaction: | done 2025-05-07T19:43:35.3795217Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:43:35.9236002Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:35.9240170Z + conda clean --packages --tarball -y 2025-05-07T19:43:35.9240393Z 2025-05-07T19:43:36.3609313Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:36.3609799Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:36.4161902Z 2025-05-07T19:43:36.4172182Z + conda clean --all -y 2025-05-07T19:43:36.4172827Z 2025-05-07T19:43:36.8640553Z There are no unused tarball(s) to remove. 2025-05-07T19:43:36.8641595Z Will remove 1 index cache(s). 2025-05-07T19:43:36.8642479Z There are no unused package(s) to remove. 2025-05-07T19:43:36.8643431Z There are no tempfile(s) to remove. 2025-05-07T19:43:36.8644308Z There are no logfile(s) to remove. 2025-05-07T19:43:36.9170605Z 2025-05-07T19:43:36.9171134Z + conda info 2025-05-07T19:43:36.9171549Z 2025-05-07T19:43:37.4734588Z 2025-05-07T19:43:37.4735270Z active environment : base 2025-05-07T19:43:37.4736217Z active env location : /github/home/miniconda 2025-05-07T19:43:37.4737190Z shell level : 1 2025-05-07T19:43:37.4737999Z user config file : /github/home/.condarc 2025-05-07T19:43:37.4739157Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:37.4740262Z conda version : 25.3.1 2025-05-07T19:43:37.4741116Z conda-build version : not installed 2025-05-07T19:43:37.4741852Z python version : 3.13.2.final.0 2025-05-07T19:43:37.4742605Z solver : libmamba (default) 2025-05-07T19:43:37.4742971Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:37.4743299Z __conda=25.3.1=0 2025-05-07T19:43:37.4743610Z __glibc=2.34=0 2025-05-07T19:43:37.4743919Z __linux=6.1.130=0 2025-05-07T19:43:37.4744200Z __unix=0=0 2025-05-07T19:43:37.4744559Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:37.4744975Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:37.4745347Z conda av metadata url : None 2025-05-07T19:43:37.4745727Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:37.4746192Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:37.4746588Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:37.4747109Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:37.4747485Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:37.4747978Z /github/home/.conda/pkgs 2025-05-07T19:43:37.4748340Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:37.4748668Z /github/home/.conda/envs 2025-05-07T19:43:37.4748984Z platform : linux-64 2025-05-07T19:43:37.4749842Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:37.4750729Z UID:GID : 0:0 2025-05-07T19:43:37.4751001Z netrc file : None 2025-05-07T19:43:37.4751257Z offline mode : False 2025-05-07T19:43:37.4751424Z 2025-05-07T19:43:37.5321112Z 2025-05-07T19:43:37.5321455Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:37.5322153Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_fd573ec3-8ac2-4670-a25f-fbafbb18ca6d ... 2025-05-07T19:43:37.5322905Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:37.5459581Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.11 2025-05-07T19:43:37.5460179Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.11 2025-05-07T19:43:37.5460994Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:37.5461372Z env: 2025-05-07T19:43:37.5461629Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:37.5461977Z BUILD_ENV: build_binary 2025-05-07T19:43:37.5462268Z BUILD_TARGET: default 2025-05-07T19:43:37.5462512Z BUILD_VARIANT: cuda 2025-05-07T19:43:37.5462786Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:37.5463047Z ##[endgroup] 2025-05-07T19:43:38.0090050Z ################################################################################ 2025-05-07T19:43:38.0090469Z # Create Conda Environment 2025-05-07T19:43:38.0090759Z # 2025-05-07T19:43:38.0106460Z # [2025-05-07T19:43:38.010Z] + create_conda_environment build_binary 3.11 2025-05-07T19:43:38.0106997Z ################################################################################ 2025-05-07T19:43:38.0107231Z 2025-05-07T19:43:38.0119790Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:38.1015616Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:38.1016781Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:38.1017764Z + conda info --envs 2025-05-07T19:43:38.1018173Z 2025-05-07T19:43:38.6724518Z 2025-05-07T19:43:38.6725550Z # conda environments: 2025-05-07T19:43:38.6726274Z # 2025-05-07T19:43:38.6726917Z base /github/home/miniconda 2025-05-07T19:43:38.6727579Z 2025-05-07T19:43:38.7308108Z 2025-05-07T19:43:38.7308522Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:40.3546961Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:40.3548336Z 2025-05-07T19:43:40.3559180Z 2025-05-07T19:43:40.3565725Z [SETUP] Creating new Conda environment (Python 3.11) ... 2025-05-07T19:43:40.3589263Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.11 2025-05-07T19:43:40.9306921Z Channels: 2025-05-07T19:43:40.9307196Z - defaults 2025-05-07T19:43:40.9307552Z Platform: linux-64 2025-05-07T19:43:42.2982283Z Collecting package metadata (repodata.json): - \ | / - \ | / done 2025-05-07T19:43:42.3988881Z Solving environment: \ done 2025-05-07T19:43:42.4276627Z 2025-05-07T19:43:42.4277102Z ## Package Plan ## 2025-05-07T19:43:42.4277800Z 2025-05-07T19:43:42.4278402Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:42.4279439Z 2025-05-07T19:43:42.4279717Z added / updated specs: 2025-05-07T19:43:42.4280453Z - python=3.11 2025-05-07T19:43:42.4280840Z 2025-05-07T19:43:42.4280852Z 2025-05-07T19:43:42.4281204Z The following packages will be downloaded: 2025-05-07T19:43:42.4281902Z 2025-05-07T19:43:42.4282270Z package | build 2025-05-07T19:43:42.4283216Z ---------------------------|----------------- 2025-05-07T19:43:42.4284331Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:42.4285535Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:42.4287234Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:42.4288486Z python-3.11.11 | he870216_0 32.9 MB 2025-05-07T19:43:42.4289677Z setuptools-78.1.1 | py311h06a4308_0 2.3 MB 2025-05-07T19:43:42.4290365Z wheel-0.45.1 | py311h06a4308_0 151 KB 2025-05-07T19:43:42.4290755Z ------------------------------------------------------------ 2025-05-07T19:43:42.4291127Z Total: 35.4 MB 2025-05-07T19:43:42.4291351Z 2025-05-07T19:43:42.4291488Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:42.4291755Z 2025-05-07T19:43:42.4291967Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:42.4292450Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:42.4293240Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:42.4293784Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:42.4294391Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:42.4294905Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:42.4295358Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:42.4295841Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:42.4296330Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:42.4296839Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:42.4297425Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:42.4297865Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:42.4298310Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:42.4298732Z python pkgs/main/linux-64::python-3.11.11-he870216_0 2025-05-07T19:43:42.4299191Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:42.4299685Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py311h06a4308_0 2025-05-07T19:43:42.4300190Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:42.4300616Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:42.4301011Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:42.4301466Z wheel pkgs/main/linux-64::wheel-0.45.1-py311h06a4308_0 2025-05-07T19:43:42.4301876Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:42.4303660Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:42.4303917Z 2025-05-07T19:43:42.4303922Z 2025-05-07T19:43:42.4303926Z 2025-05-07T19:43:42.4304102Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:42.4304495Z python-3.11.11 | 32.9 MB | | 0% 2025-05-07T19:43:42.4304759Z 2025-05-07T19:43:42.4305284Z setuptools-78.1.1 | 2.3 MB | | 0%  2025-05-07T19:43:42.4305550Z 2025-05-07T19:43:42.4305554Z 2025-05-07T19:43:42.4305787Z wheel-0.45.1 | 151 KB | | 0%  2025-05-07T19:43:42.4306043Z 2025-05-07T19:43:42.4306047Z 2025-05-07T19:43:42.4306051Z 2025-05-07T19:43:42.4312839Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:42.4313720Z 2025-05-07T19:43:42.4313733Z 2025-05-07T19:43:42.4313743Z 2025-05-07T19:43:42.4313753Z 2025-05-07T19:43:42.4315404Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:42.4316231Z 2025-05-07T19:43:42.4316243Z 2025-05-07T19:43:42.4316273Z 2025-05-07T19:43:42.4316284Z 2025-05-07T19:43:42.4316303Z 2025-05-07T19:43:42.4621380Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:42.4622286Z 2025-05-07T19:43:42.4622301Z 2025-05-07T19:43:42.4665214Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:42.4666034Z 2025-05-07T19:43:42.4666049Z 2025-05-07T19:43:42.4666060Z 2025-05-07T19:43:42.4666070Z 2025-05-07T19:43:42.4764724Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:42.4765629Z 2025-05-07T19:43:42.4765645Z 2025-05-07T19:43:42.4765657Z 2025-05-07T19:43:42.4765668Z 2025-05-07T19:43:42.4765678Z 2025-05-07T19:43:42.4886362Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:42.4887278Z 2025-05-07T19:43:42.4887293Z 2025-05-07T19:43:42.4887305Z 2025-05-07T19:43:42.4887316Z 2025-05-07T19:43:42.4887372Z 2025-05-07T19:43:42.5103261Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:42.5104174Z 2025-05-07T19:43:42.5104204Z 2025-05-07T19:43:42.5104216Z 2025-05-07T19:43:42.5104227Z 2025-05-07T19:43:42.5156520Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:42.5157275Z 2025-05-07T19:43:42.5157291Z 2025-05-07T19:43:42.5212412Z wheel-0.45.1 | 151 KB | ########## | 100%  2025-05-07T19:43:42.5212704Z 2025-05-07T19:43:42.5212709Z 2025-05-07T19:43:42.5212713Z 2025-05-07T19:43:42.5234982Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:42.5235321Z 2025-05-07T19:43:42.5278581Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:42.5338093Z python-3.11.11 | 32.9 MB | #6 | 17% 2025-05-07T19:43:42.5338912Z 2025-05-07T19:43:42.5338927Z 2025-05-07T19:43:42.5338939Z 2025-05-07T19:43:42.5339836Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:42.5340717Z 2025-05-07T19:43:42.5340729Z 2025-05-07T19:43:42.5340740Z 2025-05-07T19:43:42.6280480Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:42.7594695Z python-3.11.11 | 32.9 MB | #######8 | 78% 2025-05-07T19:43:42.7595513Z 2025-05-07T19:43:42.7596310Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:42.7596825Z 2025-05-07T19:43:42.7942010Z setuptools-78.1.1 | 2.3 MB | ########## | 100%  2025-05-07T19:43:43.3476076Z python-3.11.11 | 32.9 MB | ########## | 100% 2025-05-07T19:43:43.3478598Z python-3.11.11 | 32.9 MB | ########## | 100% 2025-05-07T19:43:43.3479017Z 2025-05-07T19:43:43.3479425Z 2025-05-07T19:43:43.3479684Z  2025-05-07T19:43:43.3479906Z 2025-05-07T19:43:43.3479910Z 2025-05-07T19:43:43.3480127Z  2025-05-07T19:43:43.3480599Z 2025-05-07T19:43:43.3480603Z 2025-05-07T19:43:43.3480607Z 2025-05-07T19:43:43.3480787Z  2025-05-07T19:43:43.3481026Z 2025-05-07T19:43:43.3481030Z 2025-05-07T19:43:43.3481041Z 2025-05-07T19:43:43.3481045Z 2025-05-07T19:43:43.3481226Z  2025-05-07T19:43:43.3481472Z 2025-05-07T19:43:43.3481476Z 2025-05-07T19:43:43.3481479Z 2025-05-07T19:43:43.3481482Z 2025-05-07T19:43:43.3481486Z 2025-05-07T19:43:43.3481689Z  done 2025-05-07T19:43:43.5541713Z Preparing transaction: / - done 2025-05-07T19:43:44.9341219Z Verifying transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:47.0492382Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:47.0534674Z # 2025-05-07T19:43:47.0535389Z # To activate this environment, use 2025-05-07T19:43:47.0536310Z # 2025-05-07T19:43:47.0536870Z # $ conda activate build_binary 2025-05-07T19:43:47.0537475Z # 2025-05-07T19:43:47.0537702Z # To deactivate an active environment, use 2025-05-07T19:43:47.0538033Z # 2025-05-07T19:43:47.0538230Z # $ conda deactivate 2025-05-07T19:43:47.0538415Z 2025-05-07T19:43:47.1338749Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:47.1373763Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:49.9973982Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:49.9975959Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (25.1) 2025-05-07T19:43:49.9976557Z 2025-05-07T19:43:49.9976651Z Collecting pip 2025-05-07T19:43:49.9976971Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:49.9977671Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:49.9978466Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 68.0 MB/s eta 0:00:00 2025-05-07T19:43:49.9978835Z Installing collected packages: pip 2025-05-07T19:43:49.9979152Z Attempting uninstall: pip 2025-05-07T19:43:49.9979437Z Found existing installation: pip 25.1 2025-05-07T19:43:49.9979765Z Uninstalling pip-25.1: 2025-05-07T19:43:49.9980043Z Successfully uninstalled pip-25.1 2025-05-07T19:43:49.9980376Z Successfully installed pip-25.1.1 2025-05-07T19:43:49.9980573Z 2025-05-07T19:43:50.0566150Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:50.0589415Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:50.7205073Z Channels: 2025-05-07T19:43:50.7205783Z - conda-forge 2025-05-07T19:43:50.7206451Z Platform: linux-64 2025-05-07T19:44:00.3824820Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:02.3174398Z Solving environment: | / - \ | done 2025-05-07T19:44:02.3598125Z 2025-05-07T19:44:02.3598604Z ## Package Plan ## 2025-05-07T19:44:02.3599315Z 2025-05-07T19:44:02.3600376Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:02.3602198Z 2025-05-07T19:44:02.3602614Z added / updated specs: 2025-05-07T19:44:02.3603417Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:02.3603998Z 2025-05-07T19:44:02.3604010Z 2025-05-07T19:44:02.3604363Z The following packages will be downloaded: 2025-05-07T19:44:02.3605000Z 2025-05-07T19:44:02.3605122Z package | build 2025-05-07T19:44:02.3605461Z ---------------------------|----------------- 2025-05-07T19:44:02.3605863Z cffi-1.17.1 | py311hf29c0ef_0 295 KB conda-forge 2025-05-07T19:44:02.3607210Z cryptography-44.0.3 | py311hafd3f86_0 1.5 MB conda-forge 2025-05-07T19:44:02.3607707Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:02.3608161Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:02.3608590Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:02.3609038Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:02.3609479Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:02.3609976Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:02.3610475Z python_abi-3.11 | 2_cp311 5 KB conda-forge 2025-05-07T19:44:02.3610973Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:02.3611526Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:02.3612289Z ------------------------------------------------------------ 2025-05-07T19:44:02.3612925Z Total: 6.4 MB 2025-05-07T19:44:02.3613151Z 2025-05-07T19:44:02.3613295Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:02.3613558Z 2025-05-07T19:44:02.3613776Z cffi conda-forge/linux-64::cffi-1.17.1-py311hf29c0ef_0 2025-05-07T19:44:02.3614327Z cryptography conda-forge/linux-64::cryptography-44.0.3-py311hafd3f86_0 2025-05-07T19:44:02.3614859Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:02.3615360Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:02.3615866Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:02.3616370Z python_abi conda-forge/linux-64::python_abi-3.11-2_cp311 2025-05-07T19:44:02.3616979Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:02.3617598Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:02.3618327Z 2025-05-07T19:44:02.3618450Z The following packages will be UPDATED: 2025-05-07T19:44:02.3618659Z 2025-05-07T19:44:02.3619086Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:02.3619872Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:02.3620556Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:02.3621203Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:02.3621606Z 2025-05-07T19:44:02.3621610Z 2025-05-07T19:44:02.3621613Z 2025-05-07T19:44:02.3621767Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:02.3622191Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:02.3622431Z 2025-05-07T19:44:02.3622753Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:02.3623036Z 2025-05-07T19:44:02.3623039Z 2025-05-07T19:44:02.3623243Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:02.3623669Z 2025-05-07T19:44:02.3623672Z 2025-05-07T19:44:02.3623676Z 2025-05-07T19:44:02.3625499Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:02.3626025Z 2025-05-07T19:44:02.3626029Z 2025-05-07T19:44:02.3626035Z 2025-05-07T19:44:02.3626040Z 2025-05-07T19:44:02.3638515Z cffi-1.17.1 | 295 KB | | 0%  2025-05-07T19:44:02.3639078Z 2025-05-07T19:44:02.3639085Z 2025-05-07T19:44:02.3639090Z 2025-05-07T19:44:02.3639093Z 2025-05-07T19:44:02.3648275Z 2025-05-07T19:44:02.3652000Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:02.3653971Z 2025-05-07T19:44:02.3654029Z 2025-05-07T19:44:02.3654053Z 2025-05-07T19:44:02.3654068Z 2025-05-07T19:44:02.3654089Z 2025-05-07T19:44:02.3654112Z 2025-05-07T19:44:02.3655469Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:02.3656876Z 2025-05-07T19:44:02.3656882Z 2025-05-07T19:44:02.3656887Z 2025-05-07T19:44:02.3656892Z 2025-05-07T19:44:02.3656896Z 2025-05-07T19:44:02.3656905Z 2025-05-07T19:44:02.3656912Z 2025-05-07T19:44:02.3657420Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:02.3658059Z 2025-05-07T19:44:02.3658064Z 2025-05-07T19:44:02.3658069Z 2025-05-07T19:44:02.3658075Z 2025-05-07T19:44:02.3658079Z 2025-05-07T19:44:02.3658086Z 2025-05-07T19:44:02.3658090Z 2025-05-07T19:44:02.3658112Z 2025-05-07T19:44:02.3658645Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:02.3659133Z 2025-05-07T19:44:02.3659142Z 2025-05-07T19:44:02.3659146Z 2025-05-07T19:44:02.3659149Z 2025-05-07T19:44:02.3659153Z 2025-05-07T19:44:02.3659156Z 2025-05-07T19:44:02.3659160Z 2025-05-07T19:44:02.3659163Z 2025-05-07T19:44:02.3659167Z 2025-05-07T19:44:02.3659440Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:02.3659739Z 2025-05-07T19:44:02.3659771Z 2025-05-07T19:44:02.3659774Z 2025-05-07T19:44:02.3659778Z 2025-05-07T19:44:02.3659781Z 2025-05-07T19:44:02.3659784Z 2025-05-07T19:44:02.3659788Z 2025-05-07T19:44:02.3659791Z 2025-05-07T19:44:02.3659794Z 2025-05-07T19:44:02.3659798Z 2025-05-07T19:44:02.4377490Z python_abi-3.11 | 5 KB | | 0%  2025-05-07T19:44:02.4377943Z 2025-05-07T19:44:02.4377959Z 2025-05-07T19:44:02.4377964Z 2025-05-07T19:44:02.4608506Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:02.4608872Z 2025-05-07T19:44:02.4609150Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:02.4609453Z 2025-05-07T19:44:02.4634511Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:02.4674679Z openssl-3.5.0 | 3.0 MB | | 1% 2025-05-07T19:44:02.4675251Z 2025-05-07T19:44:02.4675572Z 2025-05-07T19:44:02.4675926Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:02.4676200Z 2025-05-07T19:44:02.4676204Z 2025-05-07T19:44:02.4726580Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:02.4727631Z 2025-05-07T19:44:02.4727646Z 2025-05-07T19:44:02.4727650Z 2025-05-07T19:44:02.4730081Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:02.4730557Z 2025-05-07T19:44:02.4730561Z 2025-05-07T19:44:02.4730574Z 2025-05-07T19:44:02.4794803Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:02.4796289Z 2025-05-07T19:44:02.4796304Z 2025-05-07T19:44:02.4796315Z 2025-05-07T19:44:02.4796325Z 2025-05-07T19:44:02.4796369Z 2025-05-07T19:44:02.4825129Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:02.4826357Z 2025-05-07T19:44:02.4826371Z 2025-05-07T19:44:02.4826382Z 2025-05-07T19:44:02.4826408Z 2025-05-07T19:44:02.4826449Z 2025-05-07T19:44:02.5098809Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:02.5099262Z 2025-05-07T19:44:02.5099267Z 2025-05-07T19:44:02.5099271Z 2025-05-07T19:44:02.5099274Z 2025-05-07T19:44:02.5099277Z 2025-05-07T19:44:02.5099281Z 2025-05-07T19:44:02.5132600Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:02.5133003Z 2025-05-07T19:44:02.5133010Z 2025-05-07T19:44:02.5133014Z 2025-05-07T19:44:02.5133038Z 2025-05-07T19:44:02.5133042Z 2025-05-07T19:44:02.5133045Z 2025-05-07T19:44:02.5135555Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:02.5135858Z 2025-05-07T19:44:02.5135861Z 2025-05-07T19:44:02.5135865Z 2025-05-07T19:44:02.5139446Z 2025-05-07T19:44:02.5183967Z cffi-1.17.1 | 295 KB | 5 | 5%  2025-05-07T19:44:02.5185520Z 2025-05-07T19:44:02.5185541Z 2025-05-07T19:44:02.5185562Z 2025-05-07T19:44:02.5185588Z 2025-05-07T19:44:02.5185651Z 2025-05-07T19:44:02.5186175Z 2025-05-07T19:44:02.5186203Z 2025-05-07T19:44:02.5186218Z 2025-05-07T19:44:02.5214005Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:02.5214477Z 2025-05-07T19:44:02.5214482Z 2025-05-07T19:44:02.5214486Z 2025-05-07T19:44:02.5214489Z 2025-05-07T19:44:02.5214492Z 2025-05-07T19:44:02.5214496Z 2025-05-07T19:44:02.5214499Z 2025-05-07T19:44:02.5214503Z 2025-05-07T19:44:02.5235813Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:02.5236268Z 2025-05-07T19:44:02.5236273Z 2025-05-07T19:44:02.5236277Z 2025-05-07T19:44:02.5236280Z 2025-05-07T19:44:02.5236284Z 2025-05-07T19:44:02.5237932Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:02.5238425Z 2025-05-07T19:44:02.5238429Z 2025-05-07T19:44:02.5238434Z 2025-05-07T19:44:02.5238439Z 2025-05-07T19:44:02.5238445Z 2025-05-07T19:44:02.5238453Z 2025-05-07T19:44:02.5238469Z 2025-05-07T19:44:02.5271419Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:02.5271889Z 2025-05-07T19:44:02.5271894Z 2025-05-07T19:44:02.5271897Z 2025-05-07T19:44:02.5271901Z 2025-05-07T19:44:02.5271905Z 2025-05-07T19:44:02.5271908Z 2025-05-07T19:44:02.5271912Z 2025-05-07T19:44:02.5324897Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:02.5325397Z 2025-05-07T19:44:02.5325402Z 2025-05-07T19:44:02.5325405Z 2025-05-07T19:44:02.5326397Z 2025-05-07T19:44:02.5397811Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:02.5498344Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:02.5498910Z 2025-05-07T19:44:02.5498920Z 2025-05-07T19:44:02.5524665Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:02.5525719Z 2025-05-07T19:44:02.5525733Z 2025-05-07T19:44:02.5525744Z 2025-05-07T19:44:02.5525755Z 2025-05-07T19:44:02.5525788Z 2025-05-07T19:44:02.5526225Z 2025-05-07T19:44:02.5526241Z 2025-05-07T19:44:02.5526251Z 2025-05-07T19:44:02.5526262Z 2025-05-07T19:44:02.5526272Z 2025-05-07T19:44:02.5530664Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:02.5530966Z 2025-05-07T19:44:02.5530969Z 2025-05-07T19:44:02.5530973Z 2025-05-07T19:44:02.5530992Z 2025-05-07T19:44:02.5530995Z 2025-05-07T19:44:02.5531111Z 2025-05-07T19:44:02.5531115Z 2025-05-07T19:44:02.5531422Z 2025-05-07T19:44:02.5531425Z 2025-05-07T19:44:02.5531584Z 2025-05-07T19:44:02.5625657Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:02.5626004Z 2025-05-07T19:44:02.5626132Z 2025-05-07T19:44:02.5626143Z 2025-05-07T19:44:02.5626149Z 2025-05-07T19:44:02.5626174Z 2025-05-07T19:44:02.5626180Z 2025-05-07T19:44:02.5626186Z 2025-05-07T19:44:02.5626191Z 2025-05-07T19:44:02.5626197Z 2025-05-07T19:44:02.5650341Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:02.5650682Z 2025-05-07T19:44:02.5650687Z 2025-05-07T19:44:02.5650690Z 2025-05-07T19:44:02.5650694Z 2025-05-07T19:44:02.5650698Z 2025-05-07T19:44:02.5650701Z 2025-05-07T19:44:02.5650704Z 2025-05-07T19:44:02.5650708Z 2025-05-07T19:44:02.5650711Z 2025-05-07T19:44:02.5659873Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:02.5660187Z 2025-05-07T19:44:02.5660190Z 2025-05-07T19:44:02.5660194Z 2025-05-07T19:44:02.5660198Z 2025-05-07T19:44:02.5660201Z 2025-05-07T19:44:02.5660204Z 2025-05-07T19:44:02.5660208Z 2025-05-07T19:44:02.5660212Z 2025-05-07T19:44:02.5784288Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:02.5785426Z 2025-05-07T19:44:02.5786406Z 2025-05-07T19:44:02.5786419Z 2025-05-07T19:44:02.5786429Z 2025-05-07T19:44:02.5786440Z 2025-05-07T19:44:02.5786451Z 2025-05-07T19:44:02.5786461Z 2025-05-07T19:44:02.6190662Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:02.6191054Z 2025-05-07T19:44:02.6191341Z 2025-05-07T19:44:02.6191354Z 2025-05-07T19:44:02.6191403Z 2025-05-07T19:44:02.6191408Z 2025-05-07T19:44:02.6191583Z 2025-05-07T19:44:02.6192270Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:02.6192725Z 2025-05-07T19:44:02.6192729Z 2025-05-07T19:44:02.6192754Z 2025-05-07T19:44:02.6192758Z 2025-05-07T19:44:02.6192762Z 2025-05-07T19:44:02.6192766Z 2025-05-07T19:44:02.6262106Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:02.6263066Z 2025-05-07T19:44:02.6263080Z 2025-05-07T19:44:02.6263092Z 2025-05-07T19:44:02.6263130Z 2025-05-07T19:44:02.6263791Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:02.6264577Z 2025-05-07T19:44:02.6264588Z 2025-05-07T19:44:02.6264599Z 2025-05-07T19:44:02.6264609Z 2025-05-07T19:44:02.6362927Z cffi-1.17.1 | 295 KB | ########## | 100%  2025-05-07T19:44:02.6363815Z 2025-05-07T19:44:02.6363831Z 2025-05-07T19:44:02.6363842Z 2025-05-07T19:44:02.6363853Z 2025-05-07T19:44:02.6363863Z 2025-05-07T19:44:02.6363874Z 2025-05-07T19:44:02.6363884Z 2025-05-07T19:44:02.6363894Z 2025-05-07T19:44:02.6363905Z 2025-05-07T19:44:02.6363915Z 2025-05-07T19:44:02.6644332Z python_abi-3.11 | 5 KB | ########## | 100%  2025-05-07T19:44:02.6645295Z 2025-05-07T19:44:02.6645310Z 2025-05-07T19:44:02.6645322Z 2025-05-07T19:44:02.6645333Z 2025-05-07T19:44:02.6645343Z 2025-05-07T19:44:02.6645353Z 2025-05-07T19:44:02.6645364Z 2025-05-07T19:44:02.6645375Z 2025-05-07T19:44:02.6645385Z 2025-05-07T19:44:02.6646173Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:02.6647052Z 2025-05-07T19:44:02.6647063Z 2025-05-07T19:44:02.6647074Z 2025-05-07T19:44:02.6647084Z 2025-05-07T19:44:02.6647095Z 2025-05-07T19:44:02.6647105Z 2025-05-07T19:44:02.6647115Z 2025-05-07T19:44:02.6647125Z 2025-05-07T19:44:02.6647696Z 2025-05-07T19:44:02.7000257Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:02.7000599Z 2025-05-07T19:44:02.7243641Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:02.7244918Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:02.7248779Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:02.7249148Z 2025-05-07T19:44:02.7249378Z 2025-05-07T19:44:02.7249571Z  2025-05-07T19:44:02.7249789Z 2025-05-07T19:44:02.7249794Z 2025-05-07T19:44:02.7250023Z  2025-05-07T19:44:02.7250275Z 2025-05-07T19:44:02.7250279Z 2025-05-07T19:44:02.7250283Z 2025-05-07T19:44:02.7250455Z  2025-05-07T19:44:02.7250696Z 2025-05-07T19:44:02.7250700Z 2025-05-07T19:44:02.7250711Z 2025-05-07T19:44:02.7250715Z 2025-05-07T19:44:02.7250892Z  2025-05-07T19:44:02.7251117Z 2025-05-07T19:44:02.7251121Z 2025-05-07T19:44:02.7251125Z 2025-05-07T19:44:02.7251129Z 2025-05-07T19:44:02.7251152Z 2025-05-07T19:44:02.7251332Z  2025-05-07T19:44:02.7251559Z 2025-05-07T19:44:02.7251562Z 2025-05-07T19:44:02.7251566Z 2025-05-07T19:44:02.7251569Z 2025-05-07T19:44:02.7251573Z 2025-05-07T19:44:02.7251576Z 2025-05-07T19:44:02.7251780Z  2025-05-07T19:44:02.7252009Z 2025-05-07T19:44:02.7252012Z 2025-05-07T19:44:02.7252016Z 2025-05-07T19:44:02.7253206Z 2025-05-07T19:44:02.7253211Z 2025-05-07T19:44:02.7253215Z 2025-05-07T19:44:02.7253219Z 2025-05-07T19:44:02.7253476Z  2025-05-07T19:44:02.7253709Z 2025-05-07T19:44:02.7253718Z 2025-05-07T19:44:02.7253722Z 2025-05-07T19:44:02.7253725Z 2025-05-07T19:44:02.7253729Z 2025-05-07T19:44:02.7253732Z 2025-05-07T19:44:02.7253735Z 2025-05-07T19:44:02.7253739Z 2025-05-07T19:44:02.7253985Z  2025-05-07T19:44:02.7254222Z 2025-05-07T19:44:02.7254227Z 2025-05-07T19:44:02.7254230Z 2025-05-07T19:44:02.7254234Z 2025-05-07T19:44:02.7254237Z 2025-05-07T19:44:02.7254241Z 2025-05-07T19:44:02.7254244Z 2025-05-07T19:44:02.7254247Z 2025-05-07T19:44:02.7254251Z 2025-05-07T19:44:02.7254466Z  2025-05-07T19:44:02.7254703Z 2025-05-07T19:44:02.7254707Z 2025-05-07T19:44:02.7254715Z 2025-05-07T19:44:02.7254719Z 2025-05-07T19:44:02.7254722Z 2025-05-07T19:44:02.7254726Z 2025-05-07T19:44:02.7254729Z 2025-05-07T19:44:02.7254732Z 2025-05-07T19:44:02.7254736Z 2025-05-07T19:44:02.7254739Z 2025-05-07T19:44:02.7254974Z  done 2025-05-07T19:44:02.8262016Z Preparing transaction: - done 2025-05-07T19:44:02.9273441Z Verifying transaction: | done 2025-05-07T19:44:04.3299355Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:04.4244971Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:06.1188395Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:06.1207681Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:06.1233725Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:06.7917802Z Channels: 2025-05-07T19:44:06.7919257Z - conda-forge 2025-05-07T19:44:06.7919987Z Platform: linux-64 2025-05-07T19:44:09.8855873Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:10.3145123Z Solving environment: \ done 2025-05-07T19:44:10.3593540Z 2025-05-07T19:44:10.3594261Z ## Package Plan ## 2025-05-07T19:44:10.3595272Z 2025-05-07T19:44:10.3595901Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:10.3596852Z 2025-05-07T19:44:10.3597128Z added / updated specs: 2025-05-07T19:44:10.3597851Z - libxcrypt 2025-05-07T19:44:10.3598227Z 2025-05-07T19:44:10.3598239Z 2025-05-07T19:44:10.3598586Z The following packages will be downloaded: 2025-05-07T19:44:10.3599244Z 2025-05-07T19:44:10.3599671Z package | build 2025-05-07T19:44:10.3600008Z ---------------------------|----------------- 2025-05-07T19:44:10.3600647Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:10.3601048Z ------------------------------------------------------------ 2025-05-07T19:44:10.3601406Z Total: 98 KB 2025-05-07T19:44:10.3601613Z 2025-05-07T19:44:10.3601757Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:10.3601976Z 2025-05-07T19:44:10.3602210Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:10.3602516Z 2025-05-07T19:44:10.3602520Z 2025-05-07T19:44:10.3602523Z 2025-05-07T19:44:10.3602661Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:10.4692772Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:10.4707268Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:10.4808198Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:10.4808647Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:10.4809025Z 2025-05-07T19:44:10.4809344Z done 2025-05-07T19:44:10.5816530Z Preparing transaction: / done 2025-05-07T19:44:10.6824054Z Verifying transaction: \ done 2025-05-07T19:44:10.7832189Z Executing transaction: / done 2025-05-07T19:44:14.0356432Z [SETUP] Copying over ... 2025-05-07T19:44:14.0359344Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.11/crypt.h 2025-05-07T19:44:14.0359963Z 2025-05-07T19:44:14.0384320Z 2025-05-07T19:44:15.6368207Z [SETUP] Installed Python version: Python 3.11.11 2025-05-07T19:44:15.6368980Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:15.6437691Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:15.6438181Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:15.6438780Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:15.6439228Z env: 2025-05-07T19:44:15.6439446Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:15.6439765Z BUILD_ENV: build_binary 2025-05-07T19:44:15.6440010Z BUILD_TARGET: default 2025-05-07T19:44:15.6440272Z BUILD_VARIANT: cuda 2025-05-07T19:44:15.6440513Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:44:15.6440780Z ##[endgroup] 2025-05-07T19:44:16.1047136Z ################################################################################ 2025-05-07T19:44:16.1048273Z # Install C/C++ Compilers 2025-05-07T19:44:16.1048538Z # 2025-05-07T19:44:16.1064465Z # [2025-05-07T19:44:16.105Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:16.1065874Z ################################################################################ 2025-05-07T19:44:16.1066556Z 2025-05-07T19:44:16.1077451Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:16.1883807Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:16.1889736Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:16.1911952Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:16.8510998Z Channels: 2025-05-07T19:44:16.8511717Z - conda-forge 2025-05-07T19:44:16.8512363Z Platform: linux-64 2025-05-07T19:44:19.9148658Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:20.3356770Z Solving environment: \ done 2025-05-07T19:44:20.3804499Z 2025-05-07T19:44:20.3805466Z ## Package Plan ## 2025-05-07T19:44:20.3805997Z 2025-05-07T19:44:20.3806601Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:20.3807523Z 2025-05-07T19:44:20.3807796Z added / updated specs: 2025-05-07T19:44:20.3808594Z - sysroot_linux-64=2.17 2025-05-07T19:44:20.3809094Z 2025-05-07T19:44:20.3809107Z 2025-05-07T19:44:20.3809489Z The following packages will be downloaded: 2025-05-07T19:44:20.3810023Z 2025-05-07T19:44:20.3810145Z package | build 2025-05-07T19:44:20.3810499Z ---------------------------|----------------- 2025-05-07T19:44:20.3810943Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:20.3811507Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:20.3811972Z ------------------------------------------------------------ 2025-05-07T19:44:20.3812337Z Total: 15.4 MB 2025-05-07T19:44:20.3812563Z 2025-05-07T19:44:20.3812718Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:20.3812954Z 2025-05-07T19:44:20.3813263Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:20.3814014Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:20.3814340Z 2025-05-07T19:44:20.3814344Z 2025-05-07T19:44:20.3814348Z 2025-05-07T19:44:20.3814628Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:20.3814992Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:20.3815243Z 2025-05-07T19:44:20.5675214Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:20.5676550Z 2025-05-07T19:44:20.5896326Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:20.5897221Z 2025-05-07T19:44:20.6215243Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:20.7214957Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:20.7666745Z sysroot_linux-64-2.1 | 14.5 MB | ########5 | 86% 2025-05-07T19:44:20.7667190Z 2025-05-07T19:44:20.7667637Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:20.7667907Z 2025-05-07T19:44:20.7911176Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:21.2276895Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:21.2278690Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:21.2279698Z 2025-05-07T19:44:21.2280315Z 2025-05-07T19:44:21.2280869Z  done 2025-05-07T19:44:21.3289763Z Preparing transaction: / done 2025-05-07T19:44:21.5297319Z Verifying transaction: \ | done 2025-05-07T19:44:21.6304697Z Executing transaction: - done 2025-05-07T19:44:21.7117831Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:21.7118687Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:23.3298648Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:23.3308806Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:23.3330862Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:24.0282712Z Channels: 2025-05-07T19:44:24.0283136Z - conda-forge 2025-05-07T19:44:24.0283483Z Platform: linux-64 2025-05-07T19:44:27.1333611Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:28.2686938Z Solving environment: \ | / done 2025-05-07T19:44:28.3173841Z 2025-05-07T19:44:28.3174747Z ## Package Plan ## 2025-05-07T19:44:28.3175503Z 2025-05-07T19:44:28.3176538Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:28.3177079Z 2025-05-07T19:44:28.3177205Z added / updated specs: 2025-05-07T19:44:28.3177478Z - gxx_linux-64=11.4.0 2025-05-07T19:44:28.3177648Z 2025-05-07T19:44:28.3177653Z 2025-05-07T19:44:28.3177811Z The following packages will be downloaded: 2025-05-07T19:44:28.3178068Z 2025-05-07T19:44:28.3178228Z package | build 2025-05-07T19:44:28.3178603Z ---------------------------|----------------- 2025-05-07T19:44:28.3179085Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:28.3179599Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:28.3180164Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:28.3180639Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:28.3181128Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:28.3181597Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:28.3182075Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:28.3182588Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:28.3183095Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:28.3183579Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:28.3184084Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:28.3184615Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:28.3185065Z ------------------------------------------------------------ 2025-05-07T19:44:28.3185427Z Total: 91.6 MB 2025-05-07T19:44:28.3186219Z 2025-05-07T19:44:28.3186449Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:28.3186689Z 2025-05-07T19:44:28.3186983Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:28.3187610Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:28.3188206Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:28.3188901Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:28.3189473Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:28.3190017Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:28.3190606Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:28.3191226Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:28.3191766Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:28.3192374Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:28.3192854Z 2025-05-07T19:44:28.3192981Z The following packages will be UPDATED: 2025-05-07T19:44:28.3193230Z 2025-05-07T19:44:28.3193610Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:28.3194428Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:28.3194878Z 2025-05-07T19:44:28.3194882Z 2025-05-07T19:44:28.3194887Z 2025-05-07T19:44:28.3195045Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:28.3195460Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:28.3195706Z 2025-05-07T19:44:28.3196057Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:28.3196315Z 2025-05-07T19:44:28.3196324Z 2025-05-07T19:44:28.3196558Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:28.3196854Z 2025-05-07T19:44:28.3196857Z 2025-05-07T19:44:28.3196861Z 2025-05-07T19:44:28.3201590Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:28.3202495Z 2025-05-07T19:44:28.3202506Z 2025-05-07T19:44:28.3202516Z 2025-05-07T19:44:28.3202540Z 2025-05-07T19:44:28.3210695Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:28.3211584Z 2025-05-07T19:44:28.3211641Z 2025-05-07T19:44:28.3211654Z 2025-05-07T19:44:28.3211664Z 2025-05-07T19:44:28.3211674Z 2025-05-07T19:44:28.3212511Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:28.3213387Z 2025-05-07T19:44:28.3213398Z 2025-05-07T19:44:28.3213408Z 2025-05-07T19:44:28.3213418Z 2025-05-07T19:44:28.3213429Z 2025-05-07T19:44:28.3213439Z 2025-05-07T19:44:28.3214242Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:28.3215157Z 2025-05-07T19:44:28.3215169Z 2025-05-07T19:44:28.3215180Z 2025-05-07T19:44:28.3215189Z 2025-05-07T19:44:28.3215200Z 2025-05-07T19:44:28.3215209Z 2025-05-07T19:44:28.3215219Z 2025-05-07T19:44:28.3216024Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:28.3216892Z 2025-05-07T19:44:28.3216895Z 2025-05-07T19:44:28.3216899Z 2025-05-07T19:44:28.3216902Z 2025-05-07T19:44:28.3216905Z 2025-05-07T19:44:28.3216909Z 2025-05-07T19:44:28.3216912Z 2025-05-07T19:44:28.3216921Z 2025-05-07T19:44:28.3217205Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:28.3217506Z 2025-05-07T19:44:28.3217510Z 2025-05-07T19:44:28.3217514Z 2025-05-07T19:44:28.3217517Z 2025-05-07T19:44:28.3217520Z 2025-05-07T19:44:28.3217524Z 2025-05-07T19:44:28.3217527Z 2025-05-07T19:44:28.3217530Z 2025-05-07T19:44:28.3217534Z 2025-05-07T19:44:28.3217795Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:28.3218384Z 2025-05-07T19:44:28.3218388Z 2025-05-07T19:44:28.3218391Z 2025-05-07T19:44:28.3218395Z 2025-05-07T19:44:28.3218399Z 2025-05-07T19:44:28.3218402Z 2025-05-07T19:44:28.3218406Z 2025-05-07T19:44:28.3218409Z 2025-05-07T19:44:28.3218413Z 2025-05-07T19:44:28.3218416Z 2025-05-07T19:44:28.3218720Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:28.3219016Z 2025-05-07T19:44:28.3219020Z 2025-05-07T19:44:28.3219023Z 2025-05-07T19:44:28.3219125Z 2025-05-07T19:44:28.3219129Z 2025-05-07T19:44:28.3219133Z 2025-05-07T19:44:28.3219136Z 2025-05-07T19:44:28.3219140Z 2025-05-07T19:44:28.3219143Z 2025-05-07T19:44:28.3219147Z 2025-05-07T19:44:28.3219150Z 2025-05-07T19:44:28.6896958Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:28.6897986Z 2025-05-07T19:44:28.6898000Z 2025-05-07T19:44:28.6898012Z 2025-05-07T19:44:28.6898022Z 2025-05-07T19:44:28.7341049Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:28.7341647Z 2025-05-07T19:44:28.7341814Z 2025-05-07T19:44:28.7341823Z 2025-05-07T19:44:28.7341827Z 2025-05-07T19:44:28.7364572Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:28.7365064Z 2025-05-07T19:44:28.7365069Z 2025-05-07T19:44:28.7488008Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:28.7489695Z 2025-05-07T19:44:28.7508058Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:28.7711735Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:28.7712357Z 2025-05-07T19:44:28.7712363Z 2025-05-07T19:44:28.7712368Z 2025-05-07T19:44:28.7814447Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:28.7815952Z 2025-05-07T19:44:28.7815957Z 2025-05-07T19:44:28.7815960Z 2025-05-07T19:44:28.7815964Z 2025-05-07T19:44:28.7815968Z 2025-05-07T19:44:28.8364969Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:28.8365463Z 2025-05-07T19:44:28.8365468Z 2025-05-07T19:44:28.8488392Z libstdcxx-devel_linu | 11.1 MB | ########8 | 89%  2025-05-07T19:44:28.8489703Z 2025-05-07T19:44:28.8513502Z gxx_impl_linux-64-11 | 11.2 MB | ########4 | 85%  2025-05-07T19:44:28.8597451Z gcc_impl_linux-64-11 | 53.0 MB | 9 | 9% 2025-05-07T19:44:28.8597995Z 2025-05-07T19:44:28.8598004Z 2025-05-07T19:44:28.8598012Z 2025-05-07T19:44:28.8598018Z 2025-05-07T19:44:28.8598025Z 2025-05-07T19:44:28.8714760Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:28.8715216Z 2025-05-07T19:44:28.8715221Z 2025-05-07T19:44:28.8715225Z 2025-05-07T19:44:28.8785333Z binutils_impl_linux- | 6.0 MB | ######5 | 66%  2025-05-07T19:44:28.8786077Z 2025-05-07T19:44:28.8786082Z 2025-05-07T19:44:28.8786086Z 2025-05-07T19:44:28.8786106Z 2025-05-07T19:44:28.8786375Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:28.8786673Z 2025-05-07T19:44:28.8786676Z 2025-05-07T19:44:28.8786680Z 2025-05-07T19:44:28.8786683Z 2025-05-07T19:44:28.9125074Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:28.9125589Z 2025-05-07T19:44:28.9125594Z 2025-05-07T19:44:28.9147954Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:28.9148279Z 2025-05-07T19:44:28.9148428Z 2025-05-07T19:44:28.9148437Z 2025-05-07T19:44:28.9148443Z 2025-05-07T19:44:28.9148447Z 2025-05-07T19:44:28.9148465Z 2025-05-07T19:44:28.9181371Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:28.9182368Z 2025-05-07T19:44:28.9182382Z 2025-05-07T19:44:28.9182393Z 2025-05-07T19:44:28.9293834Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:28.9294317Z 2025-05-07T19:44:28.9487359Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:28.9487752Z 2025-05-07T19:44:28.9487944Z 2025-05-07T19:44:28.9488139Z 2025-05-07T19:44:28.9488144Z 2025-05-07T19:44:28.9488149Z 2025-05-07T19:44:28.9488155Z 2025-05-07T19:44:28.9488159Z 2025-05-07T19:44:28.9609201Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:28.9609557Z 2025-05-07T19:44:28.9609845Z 2025-05-07T19:44:28.9609854Z 2025-05-07T19:44:28.9609862Z 2025-05-07T19:44:28.9609867Z 2025-05-07T19:44:28.9609871Z 2025-05-07T19:44:28.9609876Z 2025-05-07T19:44:28.9657279Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:28.9657621Z 2025-05-07T19:44:28.9657626Z 2025-05-07T19:44:28.9657630Z 2025-05-07T19:44:28.9657633Z 2025-05-07T19:44:28.9657637Z 2025-05-07T19:44:28.9657640Z 2025-05-07T19:44:28.9657644Z 2025-05-07T19:44:28.9657647Z 2025-05-07T19:44:28.9657651Z 2025-05-07T19:44:28.9671602Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:28.9671928Z 2025-05-07T19:44:28.9672131Z 2025-05-07T19:44:28.9672155Z 2025-05-07T19:44:28.9672160Z 2025-05-07T19:44:28.9672165Z 2025-05-07T19:44:28.9672169Z 2025-05-07T19:44:28.9672174Z 2025-05-07T19:44:28.9672178Z 2025-05-07T19:44:28.9672183Z 2025-05-07T19:44:28.9678085Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:28.9678415Z 2025-05-07T19:44:28.9678419Z 2025-05-07T19:44:28.9678422Z 2025-05-07T19:44:28.9678426Z 2025-05-07T19:44:28.9678430Z 2025-05-07T19:44:28.9678433Z 2025-05-07T19:44:28.9678450Z 2025-05-07T19:44:28.9682612Z 2025-05-07T19:44:28.9695757Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:28.9696708Z 2025-05-07T19:44:28.9696721Z 2025-05-07T19:44:28.9696732Z 2025-05-07T19:44:28.9696742Z 2025-05-07T19:44:28.9696752Z 2025-05-07T19:44:28.9696763Z 2025-05-07T19:44:28.9701280Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:28.9701599Z 2025-05-07T19:44:28.9701602Z 2025-05-07T19:44:28.9701606Z 2025-05-07T19:44:28.9701617Z 2025-05-07T19:44:28.9701621Z 2025-05-07T19:44:28.9701624Z 2025-05-07T19:44:28.9701628Z 2025-05-07T19:44:28.9701639Z 2025-05-07T19:44:29.0024947Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:29.0025326Z 2025-05-07T19:44:29.0025332Z 2025-05-07T19:44:29.0025335Z 2025-05-07T19:44:29.0025339Z 2025-05-07T19:44:29.0025343Z 2025-05-07T19:44:29.0025347Z 2025-05-07T19:44:29.0025350Z 2025-05-07T19:44:29.0025354Z 2025-05-07T19:44:29.0025358Z 2025-05-07T19:44:29.0025396Z 2025-05-07T19:44:29.0031701Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:29.0032043Z 2025-05-07T19:44:29.0032048Z 2025-05-07T19:44:29.0032051Z 2025-05-07T19:44:29.0032055Z 2025-05-07T19:44:29.0032058Z 2025-05-07T19:44:29.0032062Z 2025-05-07T19:44:29.0032065Z 2025-05-07T19:44:29.0032069Z 2025-05-07T19:44:29.0032072Z 2025-05-07T19:44:29.0032089Z 2025-05-07T19:44:29.0088954Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:29.0089284Z 2025-05-07T19:44:29.0089289Z 2025-05-07T19:44:29.0089293Z 2025-05-07T19:44:29.0089297Z 2025-05-07T19:44:29.0089600Z 2025-05-07T19:44:29.0092723Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.0093048Z 2025-05-07T19:44:29.0093052Z 2025-05-07T19:44:29.0093056Z 2025-05-07T19:44:29.0093060Z 2025-05-07T19:44:29.0093074Z 2025-05-07T19:44:29.0139873Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:29.0140238Z 2025-05-07T19:44:29.0140323Z 2025-05-07T19:44:29.0140331Z 2025-05-07T19:44:29.0140336Z 2025-05-07T19:44:29.0140341Z 2025-05-07T19:44:29.0140345Z 2025-05-07T19:44:29.0140350Z 2025-05-07T19:44:29.0140354Z 2025-05-07T19:44:29.0140359Z 2025-05-07T19:44:29.0140390Z 2025-05-07T19:44:29.0140395Z 2025-05-07T19:44:29.0150213Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:29.0150550Z 2025-05-07T19:44:29.0150730Z 2025-05-07T19:44:29.0150734Z 2025-05-07T19:44:29.0150737Z 2025-05-07T19:44:29.0150741Z 2025-05-07T19:44:29.0150744Z 2025-05-07T19:44:29.0150761Z 2025-05-07T19:44:29.0150765Z 2025-05-07T19:44:29.0150768Z 2025-05-07T19:44:29.0150784Z 2025-05-07T19:44:29.0151889Z 2025-05-07T19:44:29.0174064Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:29.1174263Z gcc_impl_linux-64-11 | 53.0 MB | #4 | 15% 2025-05-07T19:44:29.2175430Z gcc_impl_linux-64-11 | 53.0 MB | ###1 | 31% 2025-05-07T19:44:29.3007304Z gcc_impl_linux-64-11 | 53.0 MB | ####4 | 44% 2025-05-07T19:44:29.3007677Z 2025-05-07T19:44:29.3007895Z 2025-05-07T19:44:29.3007924Z 2025-05-07T19:44:29.3175829Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:29.3237929Z gcc_impl_linux-64-11 | 53.0 MB | #####8 | 59% 2025-05-07T19:44:29.3238227Z 2025-05-07T19:44:29.3559222Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:29.3559547Z 2025-05-07T19:44:29.3559552Z 2025-05-07T19:44:29.3559556Z 2025-05-07T19:44:29.3559560Z 2025-05-07T19:44:29.3559564Z 2025-05-07T19:44:29.3559568Z 2025-05-07T19:44:29.3559572Z 2025-05-07T19:44:29.3564297Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:29.3564609Z 2025-05-07T19:44:29.3564613Z 2025-05-07T19:44:29.3564616Z 2025-05-07T19:44:29.3564620Z 2025-05-07T19:44:29.3564623Z 2025-05-07T19:44:29.3564627Z 2025-05-07T19:44:29.3564634Z 2025-05-07T19:44:29.3609991Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:29.3610456Z 2025-05-07T19:44:29.3610504Z 2025-05-07T19:44:29.3610509Z 2025-05-07T19:44:29.3610514Z 2025-05-07T19:44:29.3610519Z 2025-05-07T19:44:29.3610523Z 2025-05-07T19:44:29.3610528Z 2025-05-07T19:44:29.3610533Z 2025-05-07T19:44:29.3610537Z 2025-05-07T19:44:29.3611125Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:29.3611437Z 2025-05-07T19:44:29.3611442Z 2025-05-07T19:44:29.3611446Z 2025-05-07T19:44:29.3611449Z 2025-05-07T19:44:29.3611453Z 2025-05-07T19:44:29.3611456Z 2025-05-07T19:44:29.3611460Z 2025-05-07T19:44:29.3611463Z 2025-05-07T19:44:29.3611467Z 2025-05-07T19:44:29.4142763Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:29.4143235Z 2025-05-07T19:44:29.4143367Z 2025-05-07T19:44:29.4143376Z 2025-05-07T19:44:29.4143381Z 2025-05-07T19:44:29.4143386Z 2025-05-07T19:44:29.4143415Z 2025-05-07T19:44:29.4143419Z 2025-05-07T19:44:29.4143424Z 2025-05-07T19:44:29.4143964Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:29.4144292Z 2025-05-07T19:44:29.4144296Z 2025-05-07T19:44:29.4144300Z 2025-05-07T19:44:29.4144304Z 2025-05-07T19:44:29.4144307Z 2025-05-07T19:44:29.4144311Z 2025-05-07T19:44:29.4144315Z 2025-05-07T19:44:29.4144318Z 2025-05-07T19:44:29.4176816Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:29.4507050Z gcc_impl_linux-64-11 | 53.0 MB | #######2 | 73% 2025-05-07T19:44:29.4507426Z 2025-05-07T19:44:29.4507539Z 2025-05-07T19:44:29.4507544Z 2025-05-07T19:44:29.4507547Z 2025-05-07T19:44:29.4507551Z 2025-05-07T19:44:29.4507554Z 2025-05-07T19:44:29.4507558Z 2025-05-07T19:44:29.4507561Z 2025-05-07T19:44:29.4507565Z 2025-05-07T19:44:29.4507569Z 2025-05-07T19:44:29.4507926Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:29.4508271Z 2025-05-07T19:44:29.4508275Z 2025-05-07T19:44:29.4508279Z 2025-05-07T19:44:29.4508282Z 2025-05-07T19:44:29.4508286Z 2025-05-07T19:44:29.4508289Z 2025-05-07T19:44:29.4508293Z 2025-05-07T19:44:29.4508296Z 2025-05-07T19:44:29.4508300Z 2025-05-07T19:44:29.4508303Z 2025-05-07T19:44:29.4771980Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:29.4772347Z 2025-05-07T19:44:29.4772603Z 2025-05-07T19:44:29.4772607Z 2025-05-07T19:44:29.4772610Z 2025-05-07T19:44:29.4772614Z 2025-05-07T19:44:29.4772617Z 2025-05-07T19:44:29.4772621Z 2025-05-07T19:44:29.4772624Z 2025-05-07T19:44:29.4772628Z 2025-05-07T19:44:29.4772631Z 2025-05-07T19:44:29.4772634Z 2025-05-07T19:44:29.4772954Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:29.4773270Z 2025-05-07T19:44:29.4773273Z 2025-05-07T19:44:29.4773277Z 2025-05-07T19:44:29.4773281Z 2025-05-07T19:44:29.4773284Z 2025-05-07T19:44:29.4773391Z 2025-05-07T19:44:29.4773396Z 2025-05-07T19:44:29.4773399Z 2025-05-07T19:44:29.4773403Z 2025-05-07T19:44:29.4773406Z 2025-05-07T19:44:29.4773409Z 2025-05-07T19:44:29.4900862Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:29.4901260Z 2025-05-07T19:44:29.4901265Z 2025-05-07T19:44:29.4901269Z 2025-05-07T19:44:29.4901291Z 2025-05-07T19:44:29.4901295Z 2025-05-07T19:44:29.4901299Z 2025-05-07T19:44:29.4901655Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:29.4901979Z 2025-05-07T19:44:29.4901983Z 2025-05-07T19:44:29.4901987Z 2025-05-07T19:44:29.4901990Z 2025-05-07T19:44:29.4901994Z 2025-05-07T19:44:29.4901997Z 2025-05-07T19:44:29.5212310Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:29.6000391Z gcc_impl_linux-64-11 | 53.0 MB | ########6 | 86% 2025-05-07T19:44:29.6000689Z 2025-05-07T19:44:29.6001161Z 2025-05-07T19:44:29.6377728Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:29.7777885Z gcc_impl_linux-64-11 | 53.0 MB | #########8 | 99% 2025-05-07T19:44:30.3146609Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:30.3151838Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:30.3152318Z 2025-05-07T19:44:30.3152667Z 2025-05-07T19:44:30.3153079Z  2025-05-07T19:44:30.3153449Z 2025-05-07T19:44:30.3153489Z 2025-05-07T19:44:30.3153791Z  2025-05-07T19:44:30.3154122Z 2025-05-07T19:44:30.3154128Z 2025-05-07T19:44:30.3154151Z 2025-05-07T19:44:30.3154405Z  2025-05-07T19:44:30.3154648Z 2025-05-07T19:44:30.3154652Z 2025-05-07T19:44:30.3154657Z 2025-05-07T19:44:30.3154660Z 2025-05-07T19:44:30.3154871Z  2025-05-07T19:44:30.3155105Z 2025-05-07T19:44:30.3155108Z 2025-05-07T19:44:30.3155112Z 2025-05-07T19:44:30.3155116Z 2025-05-07T19:44:30.3155119Z 2025-05-07T19:44:30.3155308Z  2025-05-07T19:44:30.3155547Z 2025-05-07T19:44:30.3155551Z 2025-05-07T19:44:30.3155555Z 2025-05-07T19:44:30.3155558Z 2025-05-07T19:44:30.3155562Z 2025-05-07T19:44:30.3155565Z 2025-05-07T19:44:30.3155755Z  2025-05-07T19:44:30.3155981Z 2025-05-07T19:44:30.3156002Z 2025-05-07T19:44:30.3156006Z 2025-05-07T19:44:30.3156009Z 2025-05-07T19:44:30.3156013Z 2025-05-07T19:44:30.3156016Z 2025-05-07T19:44:30.3156020Z 2025-05-07T19:44:30.3156224Z  2025-05-07T19:44:30.3156471Z 2025-05-07T19:44:30.3156475Z 2025-05-07T19:44:30.3156478Z 2025-05-07T19:44:30.3156482Z 2025-05-07T19:44:30.3156485Z 2025-05-07T19:44:30.3156494Z 2025-05-07T19:44:30.3156497Z 2025-05-07T19:44:30.3156501Z 2025-05-07T19:44:30.3156688Z  2025-05-07T19:44:30.3156919Z 2025-05-07T19:44:30.3156923Z 2025-05-07T19:44:30.3156940Z 2025-05-07T19:44:30.3156944Z 2025-05-07T19:44:30.3156947Z 2025-05-07T19:44:30.3156951Z 2025-05-07T19:44:30.3156954Z 2025-05-07T19:44:30.3156957Z 2025-05-07T19:44:30.3156961Z 2025-05-07T19:44:30.3157383Z  2025-05-07T19:44:30.3157616Z 2025-05-07T19:44:30.3157620Z 2025-05-07T19:44:30.3157623Z 2025-05-07T19:44:30.3157643Z 2025-05-07T19:44:30.3157646Z 2025-05-07T19:44:30.3157650Z 2025-05-07T19:44:30.3157653Z 2025-05-07T19:44:30.3157657Z 2025-05-07T19:44:30.3157661Z 2025-05-07T19:44:30.3157664Z 2025-05-07T19:44:30.3157862Z  2025-05-07T19:44:30.3158099Z 2025-05-07T19:44:30.3158234Z 2025-05-07T19:44:30.3158238Z 2025-05-07T19:44:30.3158258Z 2025-05-07T19:44:30.3158262Z 2025-05-07T19:44:30.3158265Z 2025-05-07T19:44:30.3158269Z 2025-05-07T19:44:30.3158272Z 2025-05-07T19:44:30.3158276Z 2025-05-07T19:44:30.3158279Z 2025-05-07T19:44:30.3158283Z 2025-05-07T19:44:30.3158497Z  done 2025-05-07T19:44:30.4164750Z Preparing transaction: \ done 2025-05-07T19:44:30.7175548Z Verifying transaction: / - \ done 2025-05-07T19:44:30.8187551Z Executing transaction: / done 2025-05-07T19:44:30.9093478Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:34.5571208Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:34.5571956Z 2025-05-07T19:44:34.5583022Z 2025-05-07T19:44:34.5599924Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:34.5602013Z 2025-05-07T19:44:34.5612578Z 2025-05-07T19:44:34.5631032Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:34.5631667Z 2025-05-07T19:44:34.5641147Z 2025-05-07T19:44:34.5660504Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:34.5661165Z 2025-05-07T19:44:34.5672788Z 2025-05-07T19:44:34.5679401Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:34.5704411Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:35.2684896Z Channels: 2025-05-07T19:44:35.2686489Z - conda-forge 2025-05-07T19:44:35.2687672Z Platform: linux-64 2025-05-07T19:44:38.3971566Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:39.7481115Z Solving environment: \ | / done 2025-05-07T19:44:39.7998339Z 2025-05-07T19:44:39.7999236Z ## Package Plan ## 2025-05-07T19:44:39.8000067Z 2025-05-07T19:44:39.8001229Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:39.8001645Z 2025-05-07T19:44:39.8001750Z added / updated specs: 2025-05-07T19:44:39.8002031Z - clangxx=16.0.6 2025-05-07T19:44:39.8002300Z - compiler-rt=16.0.6 2025-05-07T19:44:39.8002567Z - libcxx 2025-05-07T19:44:39.8002810Z - llvm-openmp=16.0.6 2025-05-07T19:44:39.8003000Z 2025-05-07T19:44:39.8003007Z 2025-05-07T19:44:39.8003205Z The following packages will be downloaded: 2025-05-07T19:44:39.8003695Z 2025-05-07T19:44:39.8003935Z package | build 2025-05-07T19:44:39.8004310Z ---------------------------|----------------- 2025-05-07T19:44:39.8004708Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:39.8005217Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:39.8005695Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:39.8006194Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:39.8006693Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:39.8007174Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:39.8008042Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:39.8008516Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:39.8008973Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:39.8009525Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:39.8010269Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:39.8011041Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:39.8011623Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:39.8012086Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:39.8012519Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:39.8012938Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:39.8013330Z ------------------------------------------------------------ 2025-05-07T19:44:39.8013699Z Total: 142.6 MB 2025-05-07T19:44:39.8013922Z 2025-05-07T19:44:39.8014076Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:39.8014307Z 2025-05-07T19:44:39.8014538Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:39.8015064Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:39.8015582Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:39.8016120Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:39.8016701Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:39.8017299Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:39.8017804Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:39.8018317Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:39.8018786Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:39.8019259Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:39.8019710Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:39.8020174Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:39.8020606Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:39.8021088Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:39.8023861Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:39.8024129Z 2025-05-07T19:44:39.8024250Z The following packages will be UPDATED: 2025-05-07T19:44:39.8024473Z 2025-05-07T19:44:39.8024819Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:39.8025520Z 2025-05-07T19:44:39.8025525Z 2025-05-07T19:44:39.8025529Z 2025-05-07T19:44:39.8025711Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:39.8026126Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:39.8026396Z 2025-05-07T19:44:39.8026726Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:39.8026988Z 2025-05-07T19:44:39.8026992Z 2025-05-07T19:44:39.8050297Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:39.8051914Z 2025-05-07T19:44:39.8051938Z 2025-05-07T19:44:39.8051962Z 2025-05-07T19:44:39.8052950Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:39.8053788Z 2025-05-07T19:44:39.8053799Z 2025-05-07T19:44:39.8053834Z 2025-05-07T19:44:39.8053845Z 2025-05-07T19:44:39.8056602Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:39.8057300Z 2025-05-07T19:44:39.8057304Z 2025-05-07T19:44:39.8057307Z 2025-05-07T19:44:39.8057311Z 2025-05-07T19:44:39.8057324Z 2025-05-07T19:44:39.8065744Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:39.8066254Z 2025-05-07T19:44:39.8066259Z 2025-05-07T19:44:39.8066263Z 2025-05-07T19:44:39.8066267Z 2025-05-07T19:44:39.8066271Z 2025-05-07T19:44:39.8066274Z 2025-05-07T19:44:39.8068201Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:39.8068635Z 2025-05-07T19:44:39.8068639Z 2025-05-07T19:44:39.8068657Z 2025-05-07T19:44:39.8068661Z 2025-05-07T19:44:39.8068665Z 2025-05-07T19:44:39.8068668Z 2025-05-07T19:44:39.8068672Z 2025-05-07T19:44:39.8070781Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:39.8071300Z 2025-05-07T19:44:39.8071304Z 2025-05-07T19:44:39.8071307Z 2025-05-07T19:44:39.8071311Z 2025-05-07T19:44:39.8071324Z 2025-05-07T19:44:39.8071327Z 2025-05-07T19:44:39.8071331Z 2025-05-07T19:44:39.8071348Z 2025-05-07T19:44:39.8071749Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:39.8072336Z 2025-05-07T19:44:39.8072345Z 2025-05-07T19:44:39.8072350Z 2025-05-07T19:44:39.8072358Z 2025-05-07T19:44:39.8072366Z 2025-05-07T19:44:39.8072371Z 2025-05-07T19:44:39.8072379Z 2025-05-07T19:44:39.8072384Z 2025-05-07T19:44:39.8072388Z 2025-05-07T19:44:39.8073126Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:39.8073699Z 2025-05-07T19:44:39.8073703Z 2025-05-07T19:44:39.8073706Z 2025-05-07T19:44:39.8073709Z 2025-05-07T19:44:39.8073713Z 2025-05-07T19:44:39.8073716Z 2025-05-07T19:44:39.8073720Z 2025-05-07T19:44:39.8073723Z 2025-05-07T19:44:39.8073727Z 2025-05-07T19:44:39.8073739Z 2025-05-07T19:44:39.8080532Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:39.8081927Z 2025-05-07T19:44:39.8081939Z 2025-05-07T19:44:39.8081950Z 2025-05-07T19:44:39.8081960Z 2025-05-07T19:44:39.8081970Z 2025-05-07T19:44:39.8081979Z 2025-05-07T19:44:39.8081989Z 2025-05-07T19:44:39.8081999Z 2025-05-07T19:44:39.8082010Z 2025-05-07T19:44:39.8082042Z 2025-05-07T19:44:39.8082053Z 2025-05-07T19:44:39.8083902Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:39.8084355Z 2025-05-07T19:44:39.8084363Z 2025-05-07T19:44:39.8084368Z 2025-05-07T19:44:39.8084385Z 2025-05-07T19:44:39.8084393Z 2025-05-07T19:44:39.8084398Z 2025-05-07T19:44:39.8084406Z 2025-05-07T19:44:39.8084440Z 2025-05-07T19:44:39.8084481Z 2025-05-07T19:44:39.8084489Z 2025-05-07T19:44:39.8084497Z 2025-05-07T19:44:39.8084502Z 2025-05-07T19:44:39.8085023Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:39.8085479Z 2025-05-07T19:44:39.8085487Z 2025-05-07T19:44:39.8085516Z 2025-05-07T19:44:39.8085531Z 2025-05-07T19:44:39.8085538Z 2025-05-07T19:44:39.8085547Z 2025-05-07T19:44:39.8085552Z 2025-05-07T19:44:39.8085560Z 2025-05-07T19:44:39.8085569Z 2025-05-07T19:44:39.8085573Z 2025-05-07T19:44:39.8085577Z 2025-05-07T19:44:39.8085585Z 2025-05-07T19:44:39.8085590Z 2025-05-07T19:44:39.8086392Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:39.8086757Z 2025-05-07T19:44:39.8086761Z 2025-05-07T19:44:39.8086764Z 2025-05-07T19:44:39.8086768Z 2025-05-07T19:44:39.8086776Z 2025-05-07T19:44:39.8086780Z 2025-05-07T19:44:39.8086783Z 2025-05-07T19:44:39.8086787Z 2025-05-07T19:44:39.8086790Z 2025-05-07T19:44:39.8086793Z 2025-05-07T19:44:39.8086797Z 2025-05-07T19:44:39.8086800Z 2025-05-07T19:44:39.8086804Z 2025-05-07T19:44:39.8086807Z 2025-05-07T19:44:39.8097282Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:39.8097854Z 2025-05-07T19:44:39.8097860Z 2025-05-07T19:44:39.8098068Z 2025-05-07T19:44:39.8098072Z 2025-05-07T19:44:39.8098076Z 2025-05-07T19:44:39.8098079Z 2025-05-07T19:44:39.8098083Z 2025-05-07T19:44:39.8098086Z 2025-05-07T19:44:39.8098089Z 2025-05-07T19:44:39.8098093Z 2025-05-07T19:44:39.8098096Z 2025-05-07T19:44:39.8098099Z 2025-05-07T19:44:39.8098102Z 2025-05-07T19:44:39.8098106Z 2025-05-07T19:44:39.8098127Z 2025-05-07T19:44:39.9260914Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:39.9261375Z 2025-05-07T19:44:39.9261587Z 2025-05-07T19:44:39.9261592Z 2025-05-07T19:44:39.9261753Z 2025-05-07T19:44:39.9465423Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:39.9465988Z 2025-05-07T19:44:39.9465993Z 2025-05-07T19:44:39.9465997Z 2025-05-07T19:44:40.0411097Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:40.0412058Z 2025-05-07T19:44:40.0412066Z 2025-05-07T19:44:40.0412073Z 2025-05-07T19:44:40.0412080Z 2025-05-07T19:44:40.0517044Z icu-73.2 | 11.5 MB | 1 | 1%  2025-05-07T19:44:40.0517481Z 2025-05-07T19:44:40.0517486Z 2025-05-07T19:44:40.0572531Z 2025-05-07T19:44:40.0782386Z libclang-cpp16-16.0. | 17.3 MB | 1 | 2%  2025-05-07T19:44:40.0783088Z 2025-05-07T19:44:40.0783094Z 2025-05-07T19:44:40.1002709Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:40.1003033Z 2025-05-07T19:44:40.1370791Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:40.1410093Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:40.1410507Z 2025-05-07T19:44:40.1410622Z 2025-05-07T19:44:40.1410626Z 2025-05-07T19:44:40.1410630Z 2025-05-07T19:44:40.1520636Z icu-73.2 | 11.5 MB | ##7 | 27%  2025-05-07T19:44:40.1521547Z 2025-05-07T19:44:40.1521551Z 2025-05-07T19:44:40.1521560Z 2025-05-07T19:44:40.1782726Z libclang-cpp16-16.0. | 17.3 MB | #####6 | 56%  2025-05-07T19:44:40.1783139Z 2025-05-07T19:44:40.1783291Z 2025-05-07T19:44:40.2007657Z libllvm16-16.0.6 | 33.7 MB | ###4 | 35%  2025-05-07T19:44:40.2008517Z 2025-05-07T19:44:40.2371712Z compiler-rt_linux-64 | 36.0 MB | #3 | 14%  2025-05-07T19:44:40.2410789Z llvm-openmp-16.0.6 | 39.9 MB | #3 | 13% 2025-05-07T19:44:40.2411217Z 2025-05-07T19:44:40.2411221Z 2025-05-07T19:44:40.2411225Z 2025-05-07T19:44:40.2411229Z 2025-05-07T19:44:40.2520894Z icu-73.2 | 11.5 MB | #####3 | 53%  2025-05-07T19:44:40.2521188Z 2025-05-07T19:44:40.2521193Z 2025-05-07T19:44:40.2521810Z 2025-05-07T19:44:40.2816596Z libclang-cpp16-16.0. | 17.3 MB | #########2 | 93%  2025-05-07T19:44:40.2816920Z 2025-05-07T19:44:40.2816926Z 2025-05-07T19:44:40.3011637Z libllvm16-16.0.6 | 33.7 MB | #####4 | 55%  2025-05-07T19:44:40.3012025Z 2025-05-07T19:44:40.3372213Z compiler-rt_linux-64 | 36.0 MB | ### | 31%  2025-05-07T19:44:40.3411772Z llvm-openmp-16.0.6 | 39.9 MB | ##7 | 28% 2025-05-07T19:44:40.3412150Z 2025-05-07T19:44:40.3412211Z 2025-05-07T19:44:40.3412215Z 2025-05-07T19:44:40.3412230Z 2025-05-07T19:44:40.3819545Z icu-73.2 | 11.5 MB | #########5 | 96%  2025-05-07T19:44:40.3819927Z 2025-05-07T19:44:40.3819975Z 2025-05-07T19:44:40.4010495Z libllvm16-16.0.6 | 33.7 MB | #######7 | 77%  2025-05-07T19:44:40.4010890Z 2025-05-07T19:44:40.4511725Z compiler-rt_linux-64 | 36.0 MB | #####7 | 58%  2025-05-07T19:44:40.4512127Z 2025-05-07T19:44:40.4512212Z 2025-05-07T19:44:40.4512419Z 2025-05-07T19:44:40.4512428Z 2025-05-07T19:44:40.4618785Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:40.4619173Z 2025-05-07T19:44:40.4619460Z 2025-05-07T19:44:40.4619468Z 2025-05-07T19:44:40.4743457Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:40.5012101Z llvm-openmp-16.0.6 | 39.9 MB | ###8 | 39% 2025-05-07T19:44:40.5013352Z 2025-05-07T19:44:40.5075077Z compiler-rt_linux-64 | 36.0 MB | ######### | 90%  2025-05-07T19:44:40.5075406Z 2025-05-07T19:44:40.5075411Z 2025-05-07T19:44:40.5075415Z 2025-05-07T19:44:40.5075419Z 2025-05-07T19:44:40.5075424Z 2025-05-07T19:44:40.5075429Z 2025-05-07T19:44:40.5261283Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:40.5261898Z 2025-05-07T19:44:40.5261903Z 2025-05-07T19:44:40.5261907Z 2025-05-07T19:44:40.5261910Z 2025-05-07T19:44:40.5261913Z 2025-05-07T19:44:40.5378414Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:40.5379329Z 2025-05-07T19:44:40.5379344Z 2025-05-07T19:44:40.5379355Z 2025-05-07T19:44:40.5379365Z 2025-05-07T19:44:40.5379376Z 2025-05-07T19:44:40.5379386Z 2025-05-07T19:44:40.5705633Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:40.5706040Z 2025-05-07T19:44:40.5706210Z 2025-05-07T19:44:40.5706214Z 2025-05-07T19:44:40.5706296Z 2025-05-07T19:44:40.5706304Z 2025-05-07T19:44:40.5744075Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:40.6114080Z llvm-openmp-16.0.6 | 39.9 MB | #####4 | 55% 2025-05-07T19:44:40.6114418Z 2025-05-07T19:44:40.6114424Z 2025-05-07T19:44:40.6114429Z 2025-05-07T19:44:40.6114433Z 2025-05-07T19:44:40.6114437Z 2025-05-07T19:44:40.6114442Z 2025-05-07T19:44:40.6114445Z 2025-05-07T19:44:40.6299944Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:40.6300366Z 2025-05-07T19:44:40.6300574Z 2025-05-07T19:44:40.6300584Z 2025-05-07T19:44:40.6300591Z 2025-05-07T19:44:40.6300597Z 2025-05-07T19:44:40.6300602Z 2025-05-07T19:44:40.6300607Z 2025-05-07T19:44:40.6300613Z 2025-05-07T19:44:40.6434017Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:40.6434392Z 2025-05-07T19:44:40.6434397Z 2025-05-07T19:44:40.6434402Z 2025-05-07T19:44:40.6434407Z 2025-05-07T19:44:40.6434430Z 2025-05-07T19:44:40.6434434Z 2025-05-07T19:44:40.6434439Z 2025-05-07T19:44:40.6555250Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:40.6555578Z 2025-05-07T19:44:40.6555583Z 2025-05-07T19:44:40.6555587Z 2025-05-07T19:44:40.6555590Z 2025-05-07T19:44:40.6555593Z 2025-05-07T19:44:40.6555597Z 2025-05-07T19:44:40.6555600Z 2025-05-07T19:44:40.6555608Z 2025-05-07T19:44:40.6744853Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:40.7034784Z llvm-openmp-16.0.6 | 39.9 MB | #######2 | 73% 2025-05-07T19:44:40.7035085Z 2025-05-07T19:44:40.7035090Z 2025-05-07T19:44:40.7035094Z 2025-05-07T19:44:40.7035098Z 2025-05-07T19:44:40.7035102Z 2025-05-07T19:44:40.7035105Z 2025-05-07T19:44:40.7035124Z 2025-05-07T19:44:40.7035127Z 2025-05-07T19:44:40.7035131Z 2025-05-07T19:44:40.7142108Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:40.7142528Z 2025-05-07T19:44:40.7142644Z 2025-05-07T19:44:40.7142657Z 2025-05-07T19:44:40.7142724Z 2025-05-07T19:44:40.7142727Z 2025-05-07T19:44:40.7142955Z 2025-05-07T19:44:40.7142966Z 2025-05-07T19:44:40.7142972Z 2025-05-07T19:44:40.7142978Z 2025-05-07T19:44:40.7142984Z 2025-05-07T19:44:40.7187938Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:40.7188290Z 2025-05-07T19:44:40.7188295Z 2025-05-07T19:44:40.7188299Z 2025-05-07T19:44:40.7188302Z 2025-05-07T19:44:40.7188306Z 2025-05-07T19:44:40.7188322Z 2025-05-07T19:44:40.7188326Z 2025-05-07T19:44:40.7188329Z 2025-05-07T19:44:40.7188333Z 2025-05-07T19:44:40.7218649Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:40.7219054Z 2025-05-07T19:44:40.7219453Z 2025-05-07T19:44:40.7219558Z 2025-05-07T19:44:40.7219563Z 2025-05-07T19:44:40.7219567Z 2025-05-07T19:44:40.7219570Z 2025-05-07T19:44:40.7219574Z 2025-05-07T19:44:40.7219577Z 2025-05-07T19:44:40.7219881Z 2025-05-07T19:44:40.7219885Z 2025-05-07T19:44:40.7747318Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:40.7825855Z llvm-openmp-16.0.6 | 39.9 MB | #########7 | 97% 2025-05-07T19:44:40.7826256Z 2025-05-07T19:44:40.7826387Z 2025-05-07T19:44:40.7826395Z 2025-05-07T19:44:40.7826465Z 2025-05-07T19:44:40.7826487Z 2025-05-07T19:44:40.7826490Z 2025-05-07T19:44:40.7826494Z 2025-05-07T19:44:40.7826497Z 2025-05-07T19:44:40.7826501Z 2025-05-07T19:44:40.7826504Z 2025-05-07T19:44:40.7826727Z 2025-05-07T19:44:40.7826742Z 2025-05-07T19:44:40.7827094Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:40.7827404Z 2025-05-07T19:44:40.7827423Z 2025-05-07T19:44:40.7827426Z 2025-05-07T19:44:40.7827430Z 2025-05-07T19:44:40.7827433Z 2025-05-07T19:44:40.7827436Z 2025-05-07T19:44:40.7827440Z 2025-05-07T19:44:40.7827443Z 2025-05-07T19:44:40.7827446Z 2025-05-07T19:44:40.7827450Z 2025-05-07T19:44:40.7827460Z 2025-05-07T19:44:40.7892165Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:40.7892508Z 2025-05-07T19:44:40.7892513Z 2025-05-07T19:44:40.7892517Z 2025-05-07T19:44:40.7892520Z 2025-05-07T19:44:40.7892524Z 2025-05-07T19:44:40.7892527Z 2025-05-07T19:44:40.7892531Z 2025-05-07T19:44:40.7892534Z 2025-05-07T19:44:40.7892538Z 2025-05-07T19:44:40.7892541Z 2025-05-07T19:44:40.7892545Z 2025-05-07T19:44:40.7892548Z 2025-05-07T19:44:40.7912881Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.7913343Z 2025-05-07T19:44:40.7913414Z 2025-05-07T19:44:40.7913419Z 2025-05-07T19:44:40.7913439Z 2025-05-07T19:44:40.7913442Z 2025-05-07T19:44:40.7913446Z 2025-05-07T19:44:40.7913462Z 2025-05-07T19:44:40.7913466Z 2025-05-07T19:44:40.7913478Z 2025-05-07T19:44:40.7913495Z 2025-05-07T19:44:40.7913499Z 2025-05-07T19:44:40.8366479Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:40.8366965Z 2025-05-07T19:44:40.8367061Z 2025-05-07T19:44:40.8367066Z 2025-05-07T19:44:40.8367112Z 2025-05-07T19:44:40.8367115Z 2025-05-07T19:44:40.8367131Z 2025-05-07T19:44:40.8367135Z 2025-05-07T19:44:40.8367167Z 2025-05-07T19:44:40.8367170Z 2025-05-07T19:44:40.8367174Z 2025-05-07T19:44:40.8367177Z 2025-05-07T19:44:40.8367181Z 2025-05-07T19:44:40.8367184Z 2025-05-07T19:44:40.8367188Z 2025-05-07T19:44:40.8388354Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:40.8388684Z 2025-05-07T19:44:40.8388689Z 2025-05-07T19:44:40.8388693Z 2025-05-07T19:44:40.8388696Z 2025-05-07T19:44:40.8388700Z 2025-05-07T19:44:40.8388703Z 2025-05-07T19:44:40.8388722Z 2025-05-07T19:44:40.8388725Z 2025-05-07T19:44:40.8388729Z 2025-05-07T19:44:40.8388732Z 2025-05-07T19:44:40.8388736Z 2025-05-07T19:44:40.8388739Z 2025-05-07T19:44:40.8388742Z 2025-05-07T19:44:40.8389914Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:40.8390244Z 2025-05-07T19:44:40.8390263Z 2025-05-07T19:44:40.8390274Z 2025-05-07T19:44:40.8390278Z 2025-05-07T19:44:40.8390281Z 2025-05-07T19:44:40.8390285Z 2025-05-07T19:44:40.8390288Z 2025-05-07T19:44:40.8390292Z 2025-05-07T19:44:40.8390295Z 2025-05-07T19:44:40.8390298Z 2025-05-07T19:44:40.8390302Z 2025-05-07T19:44:40.8390305Z 2025-05-07T19:44:40.8390308Z 2025-05-07T19:44:40.8390312Z 2025-05-07T19:44:40.8430242Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:40.8430650Z 2025-05-07T19:44:40.8430655Z 2025-05-07T19:44:40.8430659Z 2025-05-07T19:44:40.8430662Z 2025-05-07T19:44:40.8430665Z 2025-05-07T19:44:40.8430669Z 2025-05-07T19:44:40.8430672Z 2025-05-07T19:44:40.8430676Z 2025-05-07T19:44:40.8430679Z 2025-05-07T19:44:40.8430683Z 2025-05-07T19:44:40.8430686Z 2025-05-07T19:44:40.8430689Z 2025-05-07T19:44:40.8430693Z 2025-05-07T19:44:40.8613065Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:40.8614094Z 2025-05-07T19:44:40.8614108Z 2025-05-07T19:44:40.8614119Z 2025-05-07T19:44:40.8614129Z 2025-05-07T19:44:40.8614139Z 2025-05-07T19:44:40.8614149Z 2025-05-07T19:44:40.8614882Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:40.8615688Z 2025-05-07T19:44:40.8615699Z 2025-05-07T19:44:40.8615709Z 2025-05-07T19:44:40.8615720Z 2025-05-07T19:44:40.8615730Z 2025-05-07T19:44:40.8616156Z 2025-05-07T19:44:40.8909597Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:40.8910012Z 2025-05-07T19:44:40.8910255Z 2025-05-07T19:44:40.8910267Z 2025-05-07T19:44:40.8910273Z 2025-05-07T19:44:40.8910277Z 2025-05-07T19:44:40.8910281Z 2025-05-07T19:44:40.8910317Z 2025-05-07T19:44:40.8910324Z 2025-05-07T19:44:40.8910327Z 2025-05-07T19:44:40.8910331Z 2025-05-07T19:44:40.8910345Z 2025-05-07T19:44:40.8910364Z 2025-05-07T19:44:40.8910367Z 2025-05-07T19:44:40.8910371Z 2025-05-07T19:44:40.8910374Z 2025-05-07T19:44:40.8926856Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:40.8927506Z 2025-05-07T19:44:40.8927553Z 2025-05-07T19:44:40.8927560Z 2025-05-07T19:44:40.8927564Z 2025-05-07T19:44:40.8927567Z 2025-05-07T19:44:40.8927571Z 2025-05-07T19:44:40.8927574Z 2025-05-07T19:44:40.8927578Z 2025-05-07T19:44:40.8927581Z 2025-05-07T19:44:40.8927584Z 2025-05-07T19:44:40.8927588Z 2025-05-07T19:44:40.8927607Z 2025-05-07T19:44:40.8927611Z 2025-05-07T19:44:40.8927614Z 2025-05-07T19:44:40.8927617Z 2025-05-07T19:44:40.8942057Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:40.8942420Z 2025-05-07T19:44:40.8942591Z 2025-05-07T19:44:40.8943089Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:40.8943396Z 2025-05-07T19:44:40.8943401Z 2025-05-07T19:44:40.9041676Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:40.9042557Z 2025-05-07T19:44:40.9042571Z 2025-05-07T19:44:40.9042582Z 2025-05-07T19:44:40.9042592Z 2025-05-07T19:44:40.9042603Z 2025-05-07T19:44:40.9043339Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:40.9043614Z 2025-05-07T19:44:40.9043618Z 2025-05-07T19:44:40.9043622Z 2025-05-07T19:44:40.9043626Z 2025-05-07T19:44:40.9043645Z 2025-05-07T19:44:40.9209118Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:40.9209700Z 2025-05-07T19:44:40.9416809Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:40.9417127Z 2025-05-07T19:44:40.9417132Z 2025-05-07T19:44:40.9417136Z 2025-05-07T19:44:40.9417139Z 2025-05-07T19:44:40.9417143Z 2025-05-07T19:44:40.9417147Z 2025-05-07T19:44:40.9417150Z 2025-05-07T19:44:40.9417412Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:40.9417740Z 2025-05-07T19:44:40.9417744Z 2025-05-07T19:44:40.9417748Z 2025-05-07T19:44:40.9417752Z 2025-05-07T19:44:40.9417755Z 2025-05-07T19:44:40.9417759Z 2025-05-07T19:44:40.9417762Z 2025-05-07T19:44:40.9755873Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:40.9756829Z 2025-05-07T19:44:40.9756844Z 2025-05-07T19:44:40.9756855Z 2025-05-07T19:44:40.9756866Z 2025-05-07T19:44:40.9756876Z 2025-05-07T19:44:40.9756886Z 2025-05-07T19:44:40.9756896Z 2025-05-07T19:44:40.9756906Z 2025-05-07T19:44:40.9757689Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:40.9758543Z 2025-05-07T19:44:40.9758554Z 2025-05-07T19:44:40.9758564Z 2025-05-07T19:44:40.9758574Z 2025-05-07T19:44:40.9758584Z 2025-05-07T19:44:40.9758594Z 2025-05-07T19:44:40.9758604Z 2025-05-07T19:44:40.9758614Z 2025-05-07T19:44:40.9822833Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:40.9823288Z 2025-05-07T19:44:40.9823553Z 2025-05-07T19:44:40.9823557Z 2025-05-07T19:44:40.9823561Z 2025-05-07T19:44:41.0014495Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:41.0014965Z 2025-05-07T19:44:41.0014976Z 2025-05-07T19:44:41.0014994Z 2025-05-07T19:44:41.0014997Z 2025-05-07T19:44:41.0015045Z 2025-05-07T19:44:41.0015049Z 2025-05-07T19:44:41.0015052Z 2025-05-07T19:44:41.0015056Z 2025-05-07T19:44:41.0015066Z 2025-05-07T19:44:41.0015070Z 2025-05-07T19:44:41.0015665Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:41.0015996Z 2025-05-07T19:44:41.0016001Z 2025-05-07T19:44:41.0016005Z 2025-05-07T19:44:41.0016009Z 2025-05-07T19:44:41.0016012Z 2025-05-07T19:44:41.0016016Z 2025-05-07T19:44:41.0016019Z 2025-05-07T19:44:41.0016050Z 2025-05-07T19:44:41.0016053Z 2025-05-07T19:44:41.0016057Z 2025-05-07T19:44:41.0036874Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:41.0037858Z 2025-05-07T19:44:41.0037873Z 2025-05-07T19:44:41.0037885Z 2025-05-07T19:44:41.0037895Z 2025-05-07T19:44:41.0037929Z 2025-05-07T19:44:41.0037940Z 2025-05-07T19:44:41.0037950Z 2025-05-07T19:44:41.0037961Z 2025-05-07T19:44:41.0037970Z 2025-05-07T19:44:41.0038703Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:41.0039502Z 2025-05-07T19:44:41.0039513Z 2025-05-07T19:44:41.0039523Z 2025-05-07T19:44:41.0039533Z 2025-05-07T19:44:41.0039565Z 2025-05-07T19:44:41.0039575Z 2025-05-07T19:44:41.0039601Z 2025-05-07T19:44:41.0039612Z 2025-05-07T19:44:41.0039622Z 2025-05-07T19:44:41.0737377Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:41.0738286Z 2025-05-07T19:44:41.0738299Z 2025-05-07T19:44:41.0738336Z 2025-05-07T19:44:41.0738346Z 2025-05-07T19:44:41.0738356Z 2025-05-07T19:44:41.0738367Z 2025-05-07T19:44:41.0738377Z 2025-05-07T19:44:41.0738387Z 2025-05-07T19:44:41.0738398Z 2025-05-07T19:44:41.0738439Z 2025-05-07T19:44:41.0738450Z 2025-05-07T19:44:41.0738460Z 2025-05-07T19:44:41.0739278Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:41.0740209Z 2025-05-07T19:44:41.0740212Z 2025-05-07T19:44:41.0740216Z 2025-05-07T19:44:41.0740220Z 2025-05-07T19:44:41.0740223Z 2025-05-07T19:44:41.0740226Z 2025-05-07T19:44:41.0740230Z 2025-05-07T19:44:41.0740233Z 2025-05-07T19:44:41.0740237Z 2025-05-07T19:44:41.0740240Z 2025-05-07T19:44:41.0740243Z 2025-05-07T19:44:41.0740247Z 2025-05-07T19:44:41.0786701Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:41.0787728Z 2025-05-07T19:44:41.0787742Z 2025-05-07T19:44:41.0787753Z 2025-05-07T19:44:41.0787763Z 2025-05-07T19:44:41.0787773Z 2025-05-07T19:44:41.0787783Z 2025-05-07T19:44:41.0787793Z 2025-05-07T19:44:41.0787802Z 2025-05-07T19:44:41.0787813Z 2025-05-07T19:44:41.0787823Z 2025-05-07T19:44:41.0787833Z 2025-05-07T19:44:41.0788627Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:41.0789483Z 2025-05-07T19:44:41.0789494Z 2025-05-07T19:44:41.0789504Z 2025-05-07T19:44:41.0789515Z 2025-05-07T19:44:41.0789525Z 2025-05-07T19:44:41.0789535Z 2025-05-07T19:44:41.0789545Z 2025-05-07T19:44:41.0789555Z 2025-05-07T19:44:41.0789566Z 2025-05-07T19:44:41.0789576Z 2025-05-07T19:44:41.0789586Z 2025-05-07T19:44:41.0921827Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:41.0922192Z 2025-05-07T19:44:41.0922197Z 2025-05-07T19:44:41.0922201Z 2025-05-07T19:44:41.0922205Z 2025-05-07T19:44:41.0922208Z 2025-05-07T19:44:41.0922212Z 2025-05-07T19:44:41.0922215Z 2025-05-07T19:44:41.0922219Z 2025-05-07T19:44:41.0922222Z 2025-05-07T19:44:41.0922226Z 2025-05-07T19:44:41.0922229Z 2025-05-07T19:44:41.0922232Z 2025-05-07T19:44:41.0922255Z 2025-05-07T19:44:41.0922258Z 2025-05-07T19:44:41.0922542Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:41.0923079Z 2025-05-07T19:44:41.0923084Z 2025-05-07T19:44:41.0923087Z 2025-05-07T19:44:41.0923091Z 2025-05-07T19:44:41.0923094Z 2025-05-07T19:44:41.0923097Z 2025-05-07T19:44:41.0923100Z 2025-05-07T19:44:41.0923104Z 2025-05-07T19:44:41.0923126Z 2025-05-07T19:44:41.0923130Z 2025-05-07T19:44:41.0923133Z 2025-05-07T19:44:41.0923137Z 2025-05-07T19:44:41.0923140Z 2025-05-07T19:44:41.0923143Z 2025-05-07T19:44:41.1055434Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:41.1056432Z 2025-05-07T19:44:41.1056445Z 2025-05-07T19:44:41.1056456Z 2025-05-07T19:44:41.1056466Z 2025-05-07T19:44:41.1056476Z 2025-05-07T19:44:41.1056487Z 2025-05-07T19:44:41.1056497Z 2025-05-07T19:44:41.1056508Z 2025-05-07T19:44:41.1056518Z 2025-05-07T19:44:41.1056528Z 2025-05-07T19:44:41.1056538Z 2025-05-07T19:44:41.1056548Z 2025-05-07T19:44:41.1056559Z 2025-05-07T19:44:41.1056569Z 2025-05-07T19:44:41.1056597Z 2025-05-07T19:44:41.1057460Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:41.1058392Z 2025-05-07T19:44:41.1058403Z 2025-05-07T19:44:41.1058413Z 2025-05-07T19:44:41.1058424Z 2025-05-07T19:44:41.1058434Z 2025-05-07T19:44:41.1058444Z 2025-05-07T19:44:41.1058454Z 2025-05-07T19:44:41.1058465Z 2025-05-07T19:44:41.1058475Z 2025-05-07T19:44:41.1058485Z 2025-05-07T19:44:41.1058495Z 2025-05-07T19:44:41.1058505Z 2025-05-07T19:44:41.1058515Z 2025-05-07T19:44:41.1058538Z 2025-05-07T19:44:41.1058550Z 2025-05-07T19:44:41.1085271Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:41.1086541Z 2025-05-07T19:44:41.1086552Z 2025-05-07T19:44:41.1086562Z 2025-05-07T19:44:41.1086572Z 2025-05-07T19:44:41.1086582Z 2025-05-07T19:44:41.1086592Z 2025-05-07T19:44:41.1086603Z 2025-05-07T19:44:41.1086613Z 2025-05-07T19:44:41.1086623Z 2025-05-07T19:44:41.1086656Z 2025-05-07T19:44:41.1086667Z 2025-05-07T19:44:41.1086702Z 2025-05-07T19:44:41.1086713Z 2025-05-07T19:44:41.1087608Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:41.1088549Z 2025-05-07T19:44:41.1088560Z 2025-05-07T19:44:41.1088570Z 2025-05-07T19:44:41.1088580Z 2025-05-07T19:44:41.1088590Z 2025-05-07T19:44:41.1088601Z 2025-05-07T19:44:41.1088636Z 2025-05-07T19:44:41.1088646Z 2025-05-07T19:44:41.1088656Z 2025-05-07T19:44:41.1088666Z 2025-05-07T19:44:41.1088676Z 2025-05-07T19:44:41.1088697Z 2025-05-07T19:44:41.1088716Z 2025-05-07T19:44:41.1132994Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:41.1257637Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:41.1258468Z 2025-05-07T19:44:41.1258482Z 2025-05-07T19:44:41.1258493Z 2025-05-07T19:44:41.5140743Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:41.5141682Z 2025-05-07T19:44:41.5624638Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:41.5625508Z 2025-05-07T19:44:41.5625522Z 2025-05-07T19:44:41.6130259Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:41.6135885Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:41.6137071Z 2025-05-07T19:44:41.6137713Z 2025-05-07T19:44:41.6138228Z  2025-05-07T19:44:41.6138847Z 2025-05-07T19:44:41.6138892Z 2025-05-07T19:44:41.6139397Z  2025-05-07T19:44:41.6140031Z 2025-05-07T19:44:41.6140043Z 2025-05-07T19:44:41.6140053Z 2025-05-07T19:44:41.6140543Z  2025-05-07T19:44:41.6141197Z 2025-05-07T19:44:41.6141209Z 2025-05-07T19:44:41.6141220Z 2025-05-07T19:44:41.6141231Z 2025-05-07T19:44:41.6141735Z  2025-05-07T19:44:41.6142760Z 2025-05-07T19:44:41.6142772Z 2025-05-07T19:44:41.6142782Z 2025-05-07T19:44:41.6142792Z 2025-05-07T19:44:41.6142802Z 2025-05-07T19:44:41.6143342Z  2025-05-07T19:44:41.6143751Z 2025-05-07T19:44:41.6143755Z 2025-05-07T19:44:41.6143758Z 2025-05-07T19:44:41.6143762Z 2025-05-07T19:44:41.6143766Z 2025-05-07T19:44:41.6143770Z 2025-05-07T19:44:41.6144174Z  2025-05-07T19:44:41.6144391Z 2025-05-07T19:44:41.6144394Z 2025-05-07T19:44:41.6144397Z 2025-05-07T19:44:41.6144400Z 2025-05-07T19:44:41.6144403Z 2025-05-07T19:44:41.6144406Z 2025-05-07T19:44:41.6144410Z 2025-05-07T19:44:41.6144584Z  2025-05-07T19:44:41.6144815Z 2025-05-07T19:44:41.6144818Z 2025-05-07T19:44:41.6144821Z 2025-05-07T19:44:41.6144825Z 2025-05-07T19:44:41.6144832Z 2025-05-07T19:44:41.6144836Z 2025-05-07T19:44:41.6144839Z 2025-05-07T19:44:41.6144842Z 2025-05-07T19:44:41.6145017Z  2025-05-07T19:44:41.6145252Z 2025-05-07T19:44:41.6145255Z 2025-05-07T19:44:41.6145259Z 2025-05-07T19:44:41.6145262Z 2025-05-07T19:44:41.6145265Z 2025-05-07T19:44:41.6145268Z 2025-05-07T19:44:41.6145271Z 2025-05-07T19:44:41.6145274Z 2025-05-07T19:44:41.6145277Z 2025-05-07T19:44:41.6145460Z  2025-05-07T19:44:41.6145694Z 2025-05-07T19:44:41.6145697Z 2025-05-07T19:44:41.6145701Z 2025-05-07T19:44:41.6145704Z 2025-05-07T19:44:41.6145707Z 2025-05-07T19:44:41.6145710Z 2025-05-07T19:44:41.6145714Z 2025-05-07T19:44:41.6145717Z 2025-05-07T19:44:41.6145720Z 2025-05-07T19:44:41.6145723Z 2025-05-07T19:44:41.6145910Z  2025-05-07T19:44:41.6146151Z 2025-05-07T19:44:41.6146155Z 2025-05-07T19:44:41.6146158Z 2025-05-07T19:44:41.6146161Z 2025-05-07T19:44:41.6146164Z 2025-05-07T19:44:41.6146167Z 2025-05-07T19:44:41.6146170Z 2025-05-07T19:44:41.6146174Z 2025-05-07T19:44:41.6146177Z 2025-05-07T19:44:41.6146180Z 2025-05-07T19:44:41.6146183Z 2025-05-07T19:44:41.6146367Z  2025-05-07T19:44:41.6146610Z 2025-05-07T19:44:41.6146613Z 2025-05-07T19:44:41.6146617Z 2025-05-07T19:44:41.6146620Z 2025-05-07T19:44:41.6146627Z 2025-05-07T19:44:41.6146630Z 2025-05-07T19:44:41.6146633Z 2025-05-07T19:44:41.6146636Z 2025-05-07T19:44:41.6146639Z 2025-05-07T19:44:41.6146642Z 2025-05-07T19:44:41.6146646Z 2025-05-07T19:44:41.6146649Z 2025-05-07T19:44:41.6146840Z  2025-05-07T19:44:41.6147081Z 2025-05-07T19:44:41.6147085Z 2025-05-07T19:44:41.6147088Z 2025-05-07T19:44:41.6147091Z 2025-05-07T19:44:41.6147097Z 2025-05-07T19:44:41.6147101Z 2025-05-07T19:44:41.6147104Z 2025-05-07T19:44:41.6147107Z 2025-05-07T19:44:41.6147111Z 2025-05-07T19:44:41.6147114Z 2025-05-07T19:44:41.6147117Z 2025-05-07T19:44:41.6147120Z 2025-05-07T19:44:41.6147123Z 2025-05-07T19:44:41.6147334Z  2025-05-07T19:44:41.6147560Z 2025-05-07T19:44:41.6147563Z 2025-05-07T19:44:41.6147566Z 2025-05-07T19:44:41.6147570Z 2025-05-07T19:44:41.6147573Z 2025-05-07T19:44:41.6147579Z 2025-05-07T19:44:41.6147583Z 2025-05-07T19:44:41.6147586Z 2025-05-07T19:44:41.6147589Z 2025-05-07T19:44:41.6147592Z 2025-05-07T19:44:41.6147595Z 2025-05-07T19:44:41.6147598Z 2025-05-07T19:44:41.6147601Z 2025-05-07T19:44:41.6147604Z 2025-05-07T19:44:41.6147821Z  2025-05-07T19:44:41.6148050Z 2025-05-07T19:44:41.6148054Z 2025-05-07T19:44:41.6148140Z 2025-05-07T19:44:41.6148143Z 2025-05-07T19:44:41.6148147Z 2025-05-07T19:44:41.6148150Z 2025-05-07T19:44:41.6148153Z 2025-05-07T19:44:41.6148156Z 2025-05-07T19:44:41.6148159Z 2025-05-07T19:44:41.6148179Z 2025-05-07T19:44:41.6148182Z 2025-05-07T19:44:41.6148185Z 2025-05-07T19:44:41.6148188Z 2025-05-07T19:44:41.6148192Z 2025-05-07T19:44:41.6148195Z 2025-05-07T19:44:41.6148423Z  done 2025-05-07T19:44:41.7146910Z Preparing transaction: \ done 2025-05-07T19:44:41.8154734Z Verifying transaction: / done 2025-05-07T19:44:41.9170881Z Executing transaction: \ done 2025-05-07T19:44:42.0066910Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:45.7195265Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:45.7195819Z 2025-05-07T19:44:45.7212305Z 2025-05-07T19:44:45.7229466Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:45.7230031Z 2025-05-07T19:44:45.7241210Z 2025-05-07T19:44:45.7260624Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:45.7261188Z 2025-05-07T19:44:45.7273167Z 2025-05-07T19:44:45.7287270Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:45.7287821Z 2025-05-07T19:44:45.7302066Z 2025-05-07T19:44:45.7302763Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:45.7303557Z 2025-05-07T19:44:46.1414192Z 2025-05-07T19:44:46.1414580Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:46.1414866Z 2025-05-07T19:44:46.5630483Z 2025-05-07T19:44:46.5631077Z + conda run -n build_binary printenv CC 2025-05-07T19:44:46.5631391Z 2025-05-07T19:44:48.3373184Z /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc 2025-05-07T19:44:48.3374366Z 2025-05-07T19:44:48.3937287Z 2025-05-07T19:44:48.3938522Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:48.3939248Z 2025-05-07T19:44:50.1571743Z /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ 2025-05-07T19:44:50.1572907Z 2025-05-07T19:44:50.2133984Z 2025-05-07T19:44:52.0737524Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:44:53.8618437Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:44:53.9317184Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:44:53.9318557Z 2025-05-07T19:44:54.3472808Z 2025-05-07T19:44:56.1533555Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:56.1534400Z 2025-05-07T19:44:56.2250791Z [CHECK] Binary cc found in PATH 2025-05-07T19:44:58.0150608Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:58.0151424Z 2025-05-07T19:44:58.0715815Z [CHECK] Binary gcc found in PATH 2025-05-07T19:44:59.8687789Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:59.8688106Z 2025-05-07T19:44:59.9400773Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:01.7288486Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:01.7289297Z 2025-05-07T19:45:01.7858159Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:01.7859390Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:01.7860748Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:01.7864265Z 2025-05-07T19:45:03.5981933Z #define _LP64 1 2025-05-07T19:45:03.5982265Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:03.5982604Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:03.5982890Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:03.5983155Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:03.5983433Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:03.5983700Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:03.5983964Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:03.5984627Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:03.5984916Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:03.5985215Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:03.5985553Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:03.5986049Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:03.5986337Z #define __CHAR_BIT__ 8 2025-05-07T19:45:03.5986612Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:03.5986938Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:03.5987430Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:03.5987773Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:03.5988085Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:03.5988413Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:03.5988728Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:03.5989064Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:03.5989387Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:03.5989731Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:03.5990047Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:03.5990354Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:03.5990672Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:03.5990999Z #define __DBL_DIG__ 15 2025-05-07T19:45:03.5991277Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:03.5991597Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:03.5991881Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:03.5992156Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.5992440Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:03.5992807Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:03.5993096Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:03.5993374Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:03.5993712Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:03.5994013Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:03.5994291Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:03.5994634Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:03.5994934Z #define __ELF__ 1 2025-05-07T19:45:03.5995177Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:03.5995438Z #define __FLOAT128__ 1 2025-05-07T19:45:03.5995696Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:03.5996002Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:03.5996344Z #define __FLT16_DIG__ 3 2025-05-07T19:45:03.5996597Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:03.5996920Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:03.5997206Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:03.5997486Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.5997779Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:03.5998041Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:03.5998424Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:03.5998668Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:03.5998943Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:03.5999204Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:03.5999475Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:03.5999749Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:03.6000020Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:03.6000311Z #define __FLT_DIG__ 6 2025-05-07T19:45:03.6000538Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:03.6000823Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:03.6001070Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:03.6001333Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.6001587Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:03.6001843Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:03.6002093Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:03.6002353Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:03.6002621Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:03.6002892Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:03.6003159Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:03.6003415Z #define __FLT_RADIX__ 2 2025-05-07T19:45:03.6003772Z #define __FXSR__ 1 2025-05-07T19:45:03.6003995Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:03.6004285Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:03.6004593Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:03.6004914Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:03.6005210Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:03.6005508Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:03.6005809Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:03.6006147Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:03.6006450Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:03.6006750Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:03.6007065Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:03.6007372Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:03.6007679Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:03.6007977Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:03.6008312Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:03.6008635Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:03.6008962Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:03.6009270Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:03.6009519Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:03.6009795Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:03.6010038Z #define __GNUC__ 4 2025-05-07T19:45:03.6010271Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:03.6010525Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:03.6010788Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:03.6011030Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:03.6011288Z #define __INT16_MAX__ 32767 2025-05-07T19:45:03.6011531Z #define __INT16_TYPE__ short 2025-05-07T19:45:03.6011794Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:03.6012043Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:03.6012275Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:03.6012525Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:03.6012773Z #define __INT32_TYPE__ int 2025-05-07T19:45:03.6013024Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:03.6013266Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:03.6013515Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:03.6013764Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6014055Z #define __INT64_TYPE__ long int 2025-05-07T19:45:03.6014303Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:03.6014550Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:03.6014801Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:03.6015036Z #define __INT8_MAX__ 127 2025-05-07T19:45:03.6015293Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:03.6015643Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:03.6015909Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:03.6016157Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:03.6016430Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6016738Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:03.6017008Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:03.6017253Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:03.6017532Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:03.6017811Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6018103Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:03.6018370Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:03.6018614Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:03.6018882Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:03.6019138Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:03.6019408Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:03.6019678Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:03.6019944Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:03.6020194Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:03.6020473Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:03.6020766Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:03.6021275Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:03.6021804Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:03.6022268Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:03.6022780Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6023125Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:03.6023439Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:03.6023709Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:03.6024000Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:03.6024270Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:03.6024579Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:03.6024875Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:03.6025246Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:03.6025533Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:03.6025830Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:03.6026127Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:03.6026410Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:03.6026699Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:03.6026967Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:03.6027252Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:03.6027549Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:03.6027836Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:03.6028107Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:03.6028397Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:03.6028711Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6029034Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:03.6029336Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:03.6029605Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:03.6029902Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:03.6030179Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:03.6030466Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:03.6030760Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:03.6031030Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:03.6031284Z #define __INT_WIDTH__ 32 2025-05-07T19:45:03.6031546Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:03.6031875Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:03.6032217Z #define __LDBL_DIG__ 18 2025-05-07T19:45:03.6032578Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:03.6032910Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:03.6033366Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:03.6033641Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:03.6033987Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:03.6034256Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:03.6034550Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:03.6034851Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:03.6035200Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:03.6035512Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:03.6035819Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:03.6036156Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:03.6036417Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:03.6036712Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:03.6037037Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6037356Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:03.6037603Z #define __LP64__ 1 2025-05-07T19:45:03.6037835Z #define __MMX__ 1 2025-05-07T19:45:03.6038058Z #define __NO_INLINE__ 1 2025-05-07T19:45:03.6038320Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:03.6038599Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:03.6038904Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:03.6039262Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:03.6039583Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:03.6039933Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:03.6040262Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:03.6040591Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:03.6040884Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:03.6041200Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:03.6041488Z #define __PIC__ 2 2025-05-07T19:45:03.6041708Z #define __PIE__ 2 2025-05-07T19:45:03.6041952Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:03.6042328Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:03.6042640Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:03.6042915Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:03.6043217Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:03.6043530Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:03.6043834Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:03.6044107Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:03.6044387Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:03.6044635Z #define __SEG_FS 1 2025-05-07T19:45:03.6044952Z #define __SEG_GS 1 2025-05-07T19:45:03.6045296Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:03.6045532Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:03.6045791Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:03.6046066Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:03.6046332Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:03.6046576Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:03.6046834Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:03.6047072Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:03.6047326Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:03.6047563Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:03.6047838Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:03.6048098Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:03.6048330Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:03.6048596Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:03.6048843Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:03.6049091Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:03.6049334Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:03.6049591Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:03.6049830Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:03.6050083Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:03.6050320Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:03.6050574Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:03.6050837Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.6051132Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:03.6051421Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:03.6051655Z #define __SSE2_MATH__ 1 2025-05-07T19:45:03.6051886Z #define __SSE2__ 1 2025-05-07T19:45:03.6052090Z #define __SSE_MATH__ 1 2025-05-07T19:45:03.6052317Z #define __SSE__ 1 2025-05-07T19:45:03.6052521Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:03.6052770Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:03.6053000Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:03.6053249Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:03.6053496Z #define __STDC__ 1 2025-05-07T19:45:03.6053721Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:03.6053981Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:03.6054223Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:03.6054484Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:03.6054722Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:03.6054975Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:03.6055228Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:03.6055518Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:03.6055763Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:03.6056018Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:03.6056252Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:03.6056502Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:03.6056760Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:03.6057030Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:03.6057315Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:03.6057564Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:03.6057823Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:03.6058066Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:03.6058325Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:03.6058582Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.6058898Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:03.6059182Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:03.6059436Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:03.6059690Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:03.6059924Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:03.6060196Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:03.6060516Z #define __UINT8_MAX__ 255 2025-05-07T19:45:03.6060782Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:03.6061079Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:03.6061339Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:03.6061610Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:03.6061864Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:03.6062136Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:03.6062407Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.6062787Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:03.6063086Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:03.6063355Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:03.6063610Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:03.6063881Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:03.6064152Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:03.6064424Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.6064758Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:03.6065058Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:03.6065328Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:03.6065599Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:03.6065883Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:03.6066147Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:03.6066429Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:03.6066726Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:03.6067022Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:03.6067298Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:03.6067555Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:03.6067823Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:03.6068083Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:03.6068389Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:03.6068677Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:03.6068955Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:03.6069215Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:03.6069488Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:03.6069791Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.6070119Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:03.6070438Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:03.6070700Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:03.6070979Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:03.6071244Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:03.6071523Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:03.6071797Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:03.6072102Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:03.6072368Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:03.6072720Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:03.6073175Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:03.6073460Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:03.6073803Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:03.6074127Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:03.6074438Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:03.6074724Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:03.6075024Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:03.6075313Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:03.6075648Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:03.6075982Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:03.6076267Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:03.6076565Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:03.6076851Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:03.6077178Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:03.6077539Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:03.6077882Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:03.6078167Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:03.6078464Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:03.6078745Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:03.6079118Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:03.6079428Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:03.6079742Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:03.6080391Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:03.6081038Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:03.6081325Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:03.6081583Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:03.6084591Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:03.6084922Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:03.6085223Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:03.6085499Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:03.6085915Z #define __amd64 1 2025-05-07T19:45:03.6086150Z #define __amd64__ 1 2025-05-07T19:45:03.6086367Z #define __clang__ 1 2025-05-07T19:45:03.6086638Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:03.6086948Z #define __clang_major__ 16 2025-05-07T19:45:03.6087223Z #define __clang_minor__ 0 2025-05-07T19:45:03.6087484Z #define __clang_patchlevel__ 6 2025-05-07T19:45:03.6088112Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:03.6088795Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:03.6089130Z #define __code_model_small__ 1 2025-05-07T19:45:03.6089408Z #define __gnu_linux__ 1 2025-05-07T19:45:03.6089643Z #define __k8 1 2025-05-07T19:45:03.6089872Z #define __k8__ 1 2025-05-07T19:45:03.6090089Z #define __linux 1 2025-05-07T19:45:03.6090325Z #define __linux__ 1 2025-05-07T19:45:03.6090547Z #define __llvm__ 1 2025-05-07T19:45:03.6090777Z #define __pic__ 2 2025-05-07T19:45:03.6090990Z #define __pie__ 2 2025-05-07T19:45:03.6091273Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:03.6091662Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:03.6092008Z #define __tune_k8__ 1 2025-05-07T19:45:03.6092253Z #define __unix 1 2025-05-07T19:45:03.6092464Z #define __unix__ 1 2025-05-07T19:45:03.6092691Z #define __x86_64 1 2025-05-07T19:45:03.6092905Z #define __x86_64__ 1 2025-05-07T19:45:03.6093143Z #define linux 1 2025-05-07T19:45:03.6093352Z #define unix 1 2025-05-07T19:45:03.6093498Z 2025-05-07T19:45:03.6553261Z 2025-05-07T19:45:03.6554278Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:03.6555191Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:03.6555485Z 2025-05-07T19:45:05.4738855Z #define _GNU_SOURCE 1 2025-05-07T19:45:05.4739704Z #define _LP64 1 2025-05-07T19:45:05.4740525Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:05.4740851Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:05.4741136Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:05.4741555Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:05.4741848Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:05.4742163Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:05.4742454Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:05.4742799Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:05.4743114Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:05.4743444Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:05.4743823Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:05.4744152Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:05.4744479Z #define __CHAR_BIT__ 8 2025-05-07T19:45:05.4744750Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:05.4745116Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:05.4745645Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:05.4746019Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:05.4746516Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:05.4746884Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:05.4747250Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:05.4747594Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:05.4747973Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:05.4748777Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:05.4749137Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:05.4749440Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:05.4749783Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:05.4750117Z #define __DBL_DIG__ 15 2025-05-07T19:45:05.4750416Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:05.4750742Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:05.4751048Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:05.4751459Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.4751746Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:05.4752048Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:05.4752333Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:05.4752764Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:05.4753264Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:05.4753628Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:05.4753906Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:05.4754258Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:05.4754582Z #define __DEPRECATED 1 2025-05-07T19:45:05.4754834Z #define __ELF__ 1 2025-05-07T19:45:05.4755075Z #define __EXCEPTIONS 1 2025-05-07T19:45:05.4755320Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:05.4755602Z #define __FLOAT128__ 1 2025-05-07T19:45:05.4755844Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:05.4756171Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:05.4756502Z #define __FLT16_DIG__ 3 2025-05-07T19:45:05.4756774Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:05.4757079Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:05.4757369Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:05.4757659Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.4757962Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:05.4758246Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:05.4758516Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:05.4758801Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:05.4759096Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:05.4759395Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:05.4759671Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:05.4759983Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:05.4760268Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:05.4760582Z #define __FLT_DIG__ 6 2025-05-07T19:45:05.4760831Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:05.4761143Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:05.4761430Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:05.4761705Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.4761989Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:05.4762252Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:05.4762537Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:05.4762799Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:05.4763104Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:05.4763385Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:05.4763672Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:05.4763956Z #define __FLT_RADIX__ 2 2025-05-07T19:45:05.4764206Z #define __FXSR__ 1 2025-05-07T19:45:05.4764457Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:05.4764756Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:05.4765083Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:05.4765499Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:05.4765808Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:05.4766090Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:05.4766389Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:05.4766674Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:05.4766997Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:05.4767297Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:05.4767611Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:05.4767922Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:05.4768235Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:05.4768642Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:05.4768964Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:05.4769299Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:05.4769617Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:05.4769943Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:05.4770230Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:05.4770535Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:05.4770787Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:05.4771161Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:05.4771413Z #define __GNUC__ 4 2025-05-07T19:45:05.4771638Z #define __GNUG__ 4 2025-05-07T19:45:05.4771879Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:05.4772160Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:05.4772472Z #define __GXX_RTTI 1 2025-05-07T19:45:05.4772691Z #define __GXX_WEAK__ 1 2025-05-07T19:45:05.4772940Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:05.4773187Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:05.4773457Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:05.4773700Z #define __INT16_MAX__ 32767 2025-05-07T19:45:05.4773967Z #define __INT16_TYPE__ short 2025-05-07T19:45:05.4774219Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:05.4774471Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:05.4774718Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:05.4774955Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:05.4775223Z #define __INT32_TYPE__ int 2025-05-07T19:45:05.4775459Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:05.4775719Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:05.4775955Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:05.4776220Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4776501Z #define __INT64_TYPE__ long int 2025-05-07T19:45:05.4776763Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:05.4776999Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:05.4777251Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:05.4777507Z #define __INT8_MAX__ 127 2025-05-07T19:45:05.4777753Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:05.4778038Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:05.4778298Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:05.4778571Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:05.4778825Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4779128Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:05.4779388Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:05.4779646Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:05.4779892Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:05.4780170Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4780472Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:05.4780739Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:05.4781001Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:05.4781262Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:05.4781535Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:05.4781797Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:05.4782078Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:05.4782334Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:05.4782606Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:05.4782867Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:05.4783158Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:05.4783425Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:05.4783682Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:05.4783958Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:05.4784237Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4784562Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:05.4784835Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:05.4785105Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:05.4785365Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:05.4785637Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:05.4786266Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:05.4786784Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:05.4787076Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:05.4787493Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:05.4787801Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:05.4788086Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:05.4788395Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:05.4788673Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:05.4788966Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:05.4789245Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:05.4789555Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:05.4789905Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:05.4790200Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:05.4790499Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:05.4790802Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4791147Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:05.4791446Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:05.4791739Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:05.4792021Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:05.4792324Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:05.4792689Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:05.4793015Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:05.4793286Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:05.4793567Z #define __INT_WIDTH__ 32 2025-05-07T19:45:05.4793837Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:05.4794164Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:05.4794529Z #define __LDBL_DIG__ 18 2025-05-07T19:45:05.4794814Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:05.4795165Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:05.4795439Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:05.4795727Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:05.4796003Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:05.4796287Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:05.4796582Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:05.4796882Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:05.4797236Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:05.4797535Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:05.4797861Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:05.4798190Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:05.4798583Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:05.4798860Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:05.4799205Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4799501Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:05.4799769Z #define __LP64__ 1 2025-05-07T19:45:05.4800003Z #define __MMX__ 1 2025-05-07T19:45:05.4800227Z #define __NO_INLINE__ 1 2025-05-07T19:45:05.4800488Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:05.4800747Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:05.4801070Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:05.4801412Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:05.4801744Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:05.4802075Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:05.4802426Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:05.4802744Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:05.4803052Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:05.4803369Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:05.4803644Z #define __PIC__ 2 2025-05-07T19:45:05.4803876Z #define __PIE__ 2 2025-05-07T19:45:05.4804097Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:05.4804385Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:05.4804676Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:05.4804953Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:05.4820707Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:05.4821058Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:05.4821340Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:05.4821617Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:05.4821889Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:05.4822118Z #define __SEG_FS 1 2025-05-07T19:45:05.4822501Z #define __SEG_GS 1 2025-05-07T19:45:05.4822714Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:05.4822978Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:05.4823228Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:05.4823525Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:05.4823781Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:05.4824050Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:05.4824303Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:05.4824559Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:05.4824822Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:05.4825121Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:05.4825410Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:05.4825663Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:05.4825925Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:05.4826178Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:05.4826446Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:05.4826685Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:05.4826949Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:05.4827198Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:05.4827455Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:05.4827710Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:05.4827950Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:05.4828202Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:05.4828451Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.4828762Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:05.4829039Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:05.4829284Z #define __SSE2_MATH__ 1 2025-05-07T19:45:05.4829507Z #define __SSE2__ 1 2025-05-07T19:45:05.4829735Z #define __SSE_MATH__ 1 2025-05-07T19:45:05.4829952Z #define __SSE__ 1 2025-05-07T19:45:05.4830213Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:05.4830537Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:05.4830787Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:05.4831037Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:05.4831270Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:05.4831513Z #define __STDC__ 1 2025-05-07T19:45:05.4831731Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:05.4831998Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:05.4832250Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:05.4832615Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:05.4832872Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:05.4833323Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:05.4833665Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:05.4833988Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:05.4834276Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:05.4834544Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:05.4834817Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:05.4835073Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:05.4835353Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:05.4835645Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:05.4835959Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:05.4836230Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:05.4836510Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:05.4836778Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:05.4837058Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:05.4837356Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.4837684Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:05.4838011Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:05.4838276Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:05.4838556Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:05.4838817Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:05.4839209Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:05.4839460Z #define __UINT8_MAX__ 255 2025-05-07T19:45:05.4839725Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:05.4840005Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:05.4840283Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:05.4840552Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:05.4840804Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:05.4841076Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:05.4841346Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.4841795Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:05.4842085Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:05.4842353Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:05.4842605Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:05.4842874Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:05.4843123Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:05.4843408Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.4843735Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:05.4844079Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:05.4844351Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:05.4844620Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:05.4844902Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:05.4845168Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:05.4845450Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:05.4845732Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:05.4846048Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:05.4846317Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:05.4846592Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:05.4846865Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:05.4847130Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:05.4847440Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:05.4847731Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:05.4848008Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:05.4848274Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:05.4848556Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:05.4848844Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.4849195Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:05.4849517Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:05.4849784Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:05.4850064Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:05.4850327Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:05.4850606Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:05.4850878Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:05.4851187Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:05.4851463Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:05.4851752Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:05.4852023Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:05.4852303Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:05.4852602Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:05.4852902Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:05.4853189Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:05.4853455Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:05.4853737Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:05.4854008Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:05.4854321Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:05.4854615Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:05.4854904Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:05.4855194Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:05.4855460Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:05.4855771Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:05.4856110Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:05.4856430Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:05.4856693Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:05.4856979Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:05.4857240Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:05.4857524Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:05.4857797Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:05.4858106Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:05.4858718Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:05.4859315Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:05.4859587Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:05.4859830Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:05.4860163Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:05.4860426Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:05.4860716Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:05.4860961Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:05.4861211Z #define __amd64 1 2025-05-07T19:45:05.4861434Z #define __amd64__ 1 2025-05-07T19:45:05.4861638Z #define __clang__ 1 2025-05-07T19:45:05.4861886Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:05.4862175Z #define __clang_major__ 16 2025-05-07T19:45:05.4862477Z #define __clang_minor__ 0 2025-05-07T19:45:05.4862721Z #define __clang_patchlevel__ 6 2025-05-07T19:45:05.4863301Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:05.4863924Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:05.4864252Z #define __code_model_small__ 1 2025-05-07T19:45:05.4864520Z #define __cplusplus 201703L 2025-05-07T19:45:05.4864782Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:05.4865090Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:05.4865377Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:05.4865676Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:05.4865946Z #define __cpp_attributes 200809L 2025-05-07T19:45:05.4866230Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:05.4866516Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:05.4866825Z #define __cpp_constexpr 201603L 2025-05-07T19:45:05.4867107Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:05.4867415Z #define __cpp_decltype 200707L 2025-05-07T19:45:05.4867688Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:05.4867969Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:05.4868288Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:05.4868757Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:05.4869066Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:05.4869362Z #define __cpp_exceptions 199711L 2025-05-07T19:45:05.4869654Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:05.4869942Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:05.4870263Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:05.4870586Z #define __cpp_hex_float 201603L 2025-05-07T19:45:05.4870852Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:05.4871163Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:05.4871489Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:05.4871818Z #define __cpp_init_captures 201304L 2025-05-07T19:45:05.4872100Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:05.4872408Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:05.4872771Z #define __cpp_lambdas 200907L 2025-05-07T19:45:05.4873251Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:05.4873645Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:05.4874025Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:05.4874414Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:05.4874762Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:05.4875149Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:05.4875502Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:05.4875792Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:05.4876096Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:05.4876400Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:05.4876719Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:05.4877089Z #define __cpp_rtti 199711L 2025-05-07T19:45:05.4877379Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:05.4877693Z #define __cpp_static_assert 201411L 2025-05-07T19:45:05.4878027Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:05.4878359Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:05.4878697Z #define __cpp_template_auto 201606L 2025-05-07T19:45:05.4879016Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:05.4879441Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:05.4879765Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:05.4880105Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:05.4880452Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:05.4880776Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:05.4881112Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:05.4881405Z #define __gnu_linux__ 1 2025-05-07T19:45:05.4881663Z #define __k8 1 2025-05-07T19:45:05.4881879Z #define __k8__ 1 2025-05-07T19:45:05.4882163Z #define __linux 1 2025-05-07T19:45:05.4882384Z #define __linux__ 1 2025-05-07T19:45:05.4882620Z #define __llvm__ 1 2025-05-07T19:45:05.4882834Z #define __pic__ 2 2025-05-07T19:45:05.4883062Z #define __pie__ 2 2025-05-07T19:45:05.4883288Z #define __private_extern__ extern 2025-05-07T19:45:05.4883631Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:05.4884031Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:05.4884368Z #define __tune_k8__ 1 2025-05-07T19:45:05.4884612Z #define __unix 1 2025-05-07T19:45:05.4884822Z #define __unix__ 1 2025-05-07T19:45:05.4885049Z #define __x86_64 1 2025-05-07T19:45:05.4885264Z #define __x86_64__ 1 2025-05-07T19:45:05.4885496Z #define linux 1 2025-05-07T19:45:05.4885899Z #define unix 1 2025-05-07T19:45:05.4886044Z 2025-05-07T19:45:05.5442789Z 2025-05-07T19:45:05.5443113Z + conda run -n build_binary c++ --version 2025-05-07T19:45:05.5443471Z 2025-05-07T19:45:07.3574222Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:07.3575017Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:07.3575331Z Thread model: posix 2025-05-07T19:45:07.3575719Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:07.3576397Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:07.3576918Z 2025-05-07T19:45:07.4310040Z 2025-05-07T19:45:07.4311508Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:07.4313682Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:07.4314665Z 2025-05-07T19:45:09.3166518Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:09.3167059Z 2025-05-07T19:45:09.3167582Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:09.3168243Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:09.3168670Z 2025-05-07T19:45:11.2069684Z #define __cplusplus 201703L 2025-05-07T19:45:11.2073917Z 2025-05-07T19:45:11.2074373Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:11.2145513Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:11.2145986Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:11.2146821Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:11.2147172Z env: 2025-05-07T19:45:11.2147462Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:11.2147773Z BUILD_ENV: build_binary 2025-05-07T19:45:11.2148046Z BUILD_TARGET: default 2025-05-07T19:45:11.2148291Z BUILD_VARIANT: cuda 2025-05-07T19:45:11.2148587Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:11.2148868Z ##[endgroup] 2025-05-07T19:45:11.7079270Z ################################################################################ 2025-05-07T19:45:11.7079688Z # Install Build Tools 2025-05-07T19:45:11.7079943Z # 2025-05-07T19:45:11.7103383Z # [2025-05-07T19:45:11.709Z] + install_build_tools build_binary 2025-05-07T19:45:11.7103845Z ################################################################################ 2025-05-07T19:45:11.7104199Z 2025-05-07T19:45:11.7120447Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:11.7964554Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:11.7971934Z [INSTALL] Installing build tools ... 2025-05-07T19:45:11.7996828Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:12.5147310Z Channels: 2025-05-07T19:45:12.5147954Z - conda-forge 2025-05-07T19:45:12.5148510Z Platform: linux-64 2025-05-07T19:45:15.6247956Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:19.3448487Z Solving environment: \ | / - done 2025-05-07T19:45:19.4043792Z 2025-05-07T19:45:19.4044271Z ## Package Plan ## 2025-05-07T19:45:19.4044745Z 2025-05-07T19:45:19.4045367Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:19.4046299Z 2025-05-07T19:45:19.4046673Z added / updated specs: 2025-05-07T19:45:19.4047382Z - auditwheel 2025-05-07T19:45:19.4048014Z - bazel 2025-05-07T19:45:19.4048606Z - cmake[version='>=3.30'] 2025-05-07T19:45:19.4049371Z - hypothesis 2025-05-07T19:45:19.4049977Z - jinja2 2025-05-07T19:45:19.4050595Z - make 2025-05-07T19:45:19.4051254Z - ncurses 2025-05-07T19:45:19.4051467Z - ninja 2025-05-07T19:45:19.4051890Z - openblas 2025-05-07T19:45:19.4052145Z - patchelf 2025-05-07T19:45:19.4052385Z - pyyaml 2025-05-07T19:45:19.4052595Z - rhash 2025-05-07T19:45:19.4052829Z - scikit-build 2025-05-07T19:45:19.4053054Z - wheel 2025-05-07T19:45:19.4053299Z 2025-05-07T19:45:19.4053306Z 2025-05-07T19:45:19.4053455Z The following packages will be downloaded: 2025-05-07T19:45:19.4053717Z 2025-05-07T19:45:19.4053836Z package | build 2025-05-07T19:45:19.4054196Z ---------------------------|----------------- 2025-05-07T19:45:19.4054596Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:19.4055064Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:19.4055521Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:19.4055989Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:19.4056427Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:19.4056843Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:19.4057276Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:19.4057695Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:19.4058250Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:19.4059040Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:19.4059607Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:19.4060275Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:19.4060796Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:19.4061319Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:19.4061766Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:19.4062244Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:19.4062712Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:19.4063166Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:19.4063588Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:19.4064002Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:19.4064438Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:19.4064869Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:19.4065300Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:19.4065857Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:19.4066270Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:19.4066695Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:19.4067079Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:19.4067483Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:19.4067918Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:19.4068389Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:19.4068810Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:19.4069224Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:19.4069701Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:19.4070127Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:19.4070546Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:19.4070977Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:19.4071438Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:19.4071904Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:19.4072355Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:19.4073114Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:19.4073556Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:19.4074139Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:19.4074628Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:19.4075088Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:19.4075584Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:19.4076034Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:19.4076522Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:19.4076994Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:19.4077590Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:19.4078083Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:19.4078542Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:19.4079010Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:19.4079547Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:19.4079969Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:19.4080365Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:19.4080800Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:19.4081233Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:19.4081635Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:19.4082043Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:19.4082459Z markupsafe-3.0.2 | py311h2dc5d0c_1 25 KB conda-forge 2025-05-07T19:45:19.4082896Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:19.4083382Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:19.4083813Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:19.4084274Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:19.4084694Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:19.4085133Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:19.4085535Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:19.4086463Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:19.4086949Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:19.4087424Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:19.4087922Z python-3.11.11 |h9e4cc4f_2_cpython 29.2 MB conda-forge 2025-05-07T19:45:19.4088380Z pyyaml-6.0.2 | py311h2dc5d0c_2 208 KB conda-forge 2025-05-07T19:45:19.4088834Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:19.4089257Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:19.4089733Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:19.4090228Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:19.4090729Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:19.4091224Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:19.4091653Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:19.4092221Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:19.4092774Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:19.4093203Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:19.4093665Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:19.4094100Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:19.4094561Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:19.4095015Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:19.4095629Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:19.4096103Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:19.4096553Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:19.4097026Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:19.4097496Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:19.4097981Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:19.4098442Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:19.4098853Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:19.4099280Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:19.4099713Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:19.4100132Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:19.4100512Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:19.4100909Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:19.4101394Z ------------------------------------------------------------ 2025-05-07T19:45:19.4101733Z Total: 336.5 MB 2025-05-07T19:45:19.4101947Z 2025-05-07T19:45:19.4102089Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:19.4102309Z 2025-05-07T19:45:19.4102504Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:19.4102947Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:19.4103412Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:19.4103854Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:19.4104277Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:19.4104683Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:19.4105103Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:19.4105512Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:19.4105938Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:19.4106453Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:19.4107045Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:19.4107670Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:19.4108273Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:19.4108860Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:19.4109380Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:19.4109876Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:19.4110381Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:19.4110842Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:19.4111290Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:19.4111751Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:19.4112219Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:19.4112846Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:19.4113612Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:19.4114075Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:19.4114663Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:19.4115110Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:19.4115550Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:19.4115980Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:19.4116495Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:19.4117032Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:19.4117495Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:19.4117992Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:19.4118515Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:19.4119021Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:19.4119587Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:19.4120057Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:19.4120577Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:19.4121078Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:19.4121688Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:19.4122178Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:19.4122612Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:19.4123106Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:19.4123581Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:19.4124066Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:19.4124580Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:19.4125021Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:19.4125525Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:19.4126016Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:19.4126491Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:19.4126965Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:19.4127442Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:19.4127901Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:19.4128329Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:19.4128759Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:19.4129211Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:19.4129686Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:19.4130106Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:19.4130561Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py311h2dc5d0c_1 2025-05-07T19:45:19.4131047Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:19.4131508Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:19.4132007Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:19.4132477Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:19.4132932Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:19.4133373Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:19.4133877Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:19.4134367Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:19.4134884Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:19.4135343Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py311h2dc5d0c_2 2025-05-07T19:45:19.4135835Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:19.4136256Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:19.4136718Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:19.4137228Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:19.4137746Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:19.4138267Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:19.4138739Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:19.4139215Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:19.4139907Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:19.4140591Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:19.4141229Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:19.4141778Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:19.4142353Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:19.4142919Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:19.4143455Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:19.4144066Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:19.4144625Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:19.4145168Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:19.4145724Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:19.4146230Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:19.4146704Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:19.4146973Z 2025-05-07T19:45:19.4147101Z The following packages will be UPDATED: 2025-05-07T19:45:19.4147343Z 2025-05-07T19:45:19.4147648Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:19.4148235Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:19.4148788Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:19.4149507Z python pkgs/main::python-3.11.11-he870216_0 --> conda-forge::python-3.11.11-h9e4cc4f_2_cpython 2025-05-07T19:45:19.4150218Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:19.4150915Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:19.4151573Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:19.4152045Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:19.4152456Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:19.4152825Z 2025-05-07T19:45:19.4153074Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:19.4153420Z 2025-05-07T19:45:19.4153665Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:19.4154037Z 2025-05-07T19:45:19.4154058Z 2025-05-07T19:45:19.4155767Z 2025-05-07T19:45:19.4155932Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:19.4156346Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:19.4156595Z 2025-05-07T19:45:19.4156909Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:19.4157182Z 2025-05-07T19:45:19.4157185Z 2025-05-07T19:45:19.4161021Z python-3.11.11 | 29.2 MB | | 0%  2025-05-07T19:45:19.4161290Z 2025-05-07T19:45:19.4161294Z 2025-05-07T19:45:19.4163159Z 2025-05-07T19:45:19.4186849Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:19.4187659Z 2025-05-07T19:45:19.4187674Z 2025-05-07T19:45:19.4187685Z 2025-05-07T19:45:19.4187696Z 2025-05-07T19:45:19.4191445Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:19.4191989Z 2025-05-07T19:45:19.4191993Z 2025-05-07T19:45:19.4191997Z 2025-05-07T19:45:19.4192000Z 2025-05-07T19:45:19.4192004Z 2025-05-07T19:45:19.4192267Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:19.4192661Z 2025-05-07T19:45:19.4192665Z 2025-05-07T19:45:19.4192668Z 2025-05-07T19:45:19.4192672Z 2025-05-07T19:45:19.4192675Z 2025-05-07T19:45:19.4192678Z 2025-05-07T19:45:19.4193322Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:19.4194693Z 2025-05-07T19:45:19.4194697Z 2025-05-07T19:45:19.4194701Z 2025-05-07T19:45:19.4194705Z 2025-05-07T19:45:19.4194708Z 2025-05-07T19:45:19.4194725Z 2025-05-07T19:45:19.4194728Z 2025-05-07T19:45:19.4194982Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:19.4195282Z 2025-05-07T19:45:19.4195285Z 2025-05-07T19:45:19.4195289Z 2025-05-07T19:45:19.4195292Z 2025-05-07T19:45:19.4195296Z 2025-05-07T19:45:19.4195299Z 2025-05-07T19:45:19.4195303Z 2025-05-07T19:45:19.4195306Z 2025-05-07T19:45:19.4203012Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:19.4203905Z 2025-05-07T19:45:19.4203916Z 2025-05-07T19:45:19.4203927Z 2025-05-07T19:45:19.4203936Z 2025-05-07T19:45:19.4203947Z 2025-05-07T19:45:19.4203957Z 2025-05-07T19:45:19.4203966Z 2025-05-07T19:45:19.4203976Z 2025-05-07T19:45:19.4203986Z 2025-05-07T19:45:19.4204757Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:19.4205676Z 2025-05-07T19:45:19.4205687Z 2025-05-07T19:45:19.4205697Z 2025-05-07T19:45:19.4205707Z 2025-05-07T19:45:19.4205717Z 2025-05-07T19:45:19.4205728Z 2025-05-07T19:45:19.4205738Z 2025-05-07T19:45:19.4205749Z 2025-05-07T19:45:19.4205759Z 2025-05-07T19:45:19.4205769Z 2025-05-07T19:45:19.4206451Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:19.4207238Z 2025-05-07T19:45:19.4207250Z 2025-05-07T19:45:19.4207260Z 2025-05-07T19:45:19.4207270Z 2025-05-07T19:45:19.4207280Z 2025-05-07T19:45:19.4207290Z 2025-05-07T19:45:19.4207300Z 2025-05-07T19:45:19.4207310Z 2025-05-07T19:45:19.4207333Z 2025-05-07T19:45:19.4207344Z 2025-05-07T19:45:19.4207354Z 2025-05-07T19:45:19.4208169Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:19.4209118Z 2025-05-07T19:45:19.4209129Z 2025-05-07T19:45:19.4209140Z 2025-05-07T19:45:19.4209163Z 2025-05-07T19:45:19.4209174Z 2025-05-07T19:45:19.4209184Z 2025-05-07T19:45:19.4209194Z 2025-05-07T19:45:19.4209204Z 2025-05-07T19:45:19.4209214Z 2025-05-07T19:45:19.4209224Z 2025-05-07T19:45:19.4209234Z 2025-05-07T19:45:19.4209245Z 2025-05-07T19:45:19.4210048Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:19.4210912Z 2025-05-07T19:45:19.4210922Z 2025-05-07T19:45:19.4210933Z 2025-05-07T19:45:19.4210943Z 2025-05-07T19:45:19.4210955Z 2025-05-07T19:45:19.4210965Z 2025-05-07T19:45:19.4210975Z 2025-05-07T19:45:19.4210985Z 2025-05-07T19:45:19.4210995Z 2025-05-07T19:45:19.4211006Z 2025-05-07T19:45:19.4211016Z 2025-05-07T19:45:19.4211273Z 2025-05-07T19:45:19.4211286Z 2025-05-07T19:45:19.4215213Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:19.4215539Z 2025-05-07T19:45:19.4215543Z 2025-05-07T19:45:19.4215562Z 2025-05-07T19:45:19.4215566Z 2025-05-07T19:45:19.4215573Z 2025-05-07T19:45:19.4215577Z 2025-05-07T19:45:19.4215580Z 2025-05-07T19:45:19.4215584Z 2025-05-07T19:45:19.4215587Z 2025-05-07T19:45:19.4215590Z 2025-05-07T19:45:19.4215614Z 2025-05-07T19:45:19.4215617Z 2025-05-07T19:45:19.4215621Z 2025-05-07T19:45:19.4217681Z 2025-05-07T19:45:19.4217955Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:19.4218258Z 2025-05-07T19:45:19.4218262Z 2025-05-07T19:45:19.4218284Z 2025-05-07T19:45:19.4218287Z 2025-05-07T19:45:19.4218291Z 2025-05-07T19:45:19.4218294Z 2025-05-07T19:45:19.4218297Z 2025-05-07T19:45:19.4218301Z 2025-05-07T19:45:19.4218304Z 2025-05-07T19:45:19.4218308Z 2025-05-07T19:45:19.4218315Z 2025-05-07T19:45:19.4218319Z 2025-05-07T19:45:19.4218322Z 2025-05-07T19:45:19.4218326Z 2025-05-07T19:45:19.4218329Z 2025-05-07T19:45:19.4218642Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:19.4219005Z 2025-05-07T19:45:19.4219079Z 2025-05-07T19:45:19.4219082Z 2025-05-07T19:45:19.4219086Z 2025-05-07T19:45:19.4219089Z 2025-05-07T19:45:19.4219092Z 2025-05-07T19:45:19.4219096Z 2025-05-07T19:45:19.4219099Z 2025-05-07T19:45:19.4219102Z 2025-05-07T19:45:19.4219106Z 2025-05-07T19:45:19.4219109Z 2025-05-07T19:45:19.4219112Z 2025-05-07T19:45:19.4219116Z 2025-05-07T19:45:19.4219119Z 2025-05-07T19:45:19.4219123Z 2025-05-07T19:45:19.4219126Z 2025-05-07T19:45:19.4219439Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:19.4219744Z 2025-05-07T19:45:19.4219747Z 2025-05-07T19:45:19.4219750Z 2025-05-07T19:45:19.4219754Z 2025-05-07T19:45:19.4219761Z 2025-05-07T19:45:19.4219765Z 2025-05-07T19:45:19.4219768Z 2025-05-07T19:45:19.4219771Z 2025-05-07T19:45:19.4219775Z 2025-05-07T19:45:19.4219778Z 2025-05-07T19:45:19.4219782Z 2025-05-07T19:45:19.4219785Z 2025-05-07T19:45:19.4219806Z 2025-05-07T19:45:19.4219809Z 2025-05-07T19:45:19.4219812Z 2025-05-07T19:45:19.4219820Z 2025-05-07T19:45:19.4219824Z 2025-05-07T19:45:19.4220146Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:19.4220450Z 2025-05-07T19:45:19.4220454Z 2025-05-07T19:45:19.4220458Z 2025-05-07T19:45:19.4220477Z 2025-05-07T19:45:19.4220481Z 2025-05-07T19:45:19.4220484Z 2025-05-07T19:45:19.4220488Z 2025-05-07T19:45:19.4220491Z 2025-05-07T19:45:19.4220494Z 2025-05-07T19:45:19.4220509Z 2025-05-07T19:45:19.4220512Z 2025-05-07T19:45:19.4220516Z 2025-05-07T19:45:19.4220519Z 2025-05-07T19:45:19.4220523Z 2025-05-07T19:45:19.4220526Z 2025-05-07T19:45:19.4220530Z 2025-05-07T19:45:19.4220533Z 2025-05-07T19:45:19.4221716Z 2025-05-07T19:45:19.4222073Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:19.4222403Z 2025-05-07T19:45:19.4222419Z 2025-05-07T19:45:19.4222423Z 2025-05-07T19:45:19.4222426Z 2025-05-07T19:45:19.4222430Z 2025-05-07T19:45:19.4222438Z 2025-05-07T19:45:19.4222442Z 2025-05-07T19:45:19.4222445Z 2025-05-07T19:45:19.4222448Z 2025-05-07T19:45:19.4222452Z 2025-05-07T19:45:19.4222455Z 2025-05-07T19:45:19.4222459Z 2025-05-07T19:45:19.4222462Z 2025-05-07T19:45:19.4222482Z 2025-05-07T19:45:19.4222486Z 2025-05-07T19:45:19.4222489Z 2025-05-07T19:45:19.4222493Z 2025-05-07T19:45:19.4222496Z 2025-05-07T19:45:19.4222500Z 2025-05-07T19:45:19.6293149Z ... (more hidden) ... 2025-05-07T19:45:19.6293663Z 2025-05-07T19:45:19.6293681Z 2025-05-07T19:45:19.6293685Z 2025-05-07T19:45:19.6562616Z cmake-4.0.2 | 19.4 MB | 1 | 2%  2025-05-07T19:45:19.6563142Z 2025-05-07T19:45:19.6563147Z 2025-05-07T19:45:19.6563151Z 2025-05-07T19:45:19.6563155Z 2025-05-07T19:45:19.7004268Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:19.7004596Z 2025-05-07T19:45:19.7102512Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:19.7102802Z 2025-05-07T19:45:19.7102807Z 2025-05-07T19:45:19.7293315Z python-3.11.11 | 29.2 MB | | 0%  2025-05-07T19:45:19.7294696Z 2025-05-07T19:45:19.7294736Z 2025-05-07T19:45:19.7294756Z 2025-05-07T19:45:19.7502564Z cmake-4.0.2 | 19.4 MB | ####2 | 43%  2025-05-07T19:45:19.7559276Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:19.7559569Z 2025-05-07T19:45:19.7559574Z 2025-05-07T19:45:19.7559578Z 2025-05-07T19:45:19.7560689Z 2025-05-07T19:45:19.8007067Z libgrpc-1.71.0 | 7.6 MB | ########9 | 90%  2025-05-07T19:45:19.8007384Z 2025-05-07T19:45:19.8104797Z bazel-7.5.0 | 47.4 MB | #3 | 13%  2025-05-07T19:45:19.8105077Z 2025-05-07T19:45:19.8105267Z 2025-05-07T19:45:19.8293942Z python-3.11.11 | 29.2 MB | ## | 20%  2025-05-07T19:45:19.8294234Z 2025-05-07T19:45:19.8294396Z 2025-05-07T19:45:19.8294404Z 2025-05-07T19:45:19.8503700Z cmake-4.0.2 | 19.4 MB | #######4 | 74%  2025-05-07T19:45:19.8921246Z openjdk-23.0.1 | 181.3 MB | 3 | 3% 2025-05-07T19:45:19.8921566Z 2025-05-07T19:45:19.8921572Z 2025-05-07T19:45:19.8921577Z 2025-05-07T19:45:19.8921581Z 2025-05-07T19:45:19.9007198Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:19.9007517Z 2025-05-07T19:45:19.9105913Z bazel-7.5.0 | 47.4 MB | ##4 | 25%  2025-05-07T19:45:19.9106679Z 2025-05-07T19:45:19.9106709Z 2025-05-07T19:45:19.9459597Z python-3.11.11 | 29.2 MB | ####1 | 41%  2025-05-07T19:45:19.9459903Z 2025-05-07T19:45:19.9460036Z 2025-05-07T19:45:19.9460061Z 2025-05-07T19:45:19.9460066Z 2025-05-07T19:45:19.9460098Z 2025-05-07T19:45:19.9502689Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:20.0008430Z openjdk-23.0.1 | 181.3 MB | 6 | 6% 2025-05-07T19:45:20.0008752Z 2025-05-07T19:45:20.0106612Z bazel-7.5.0 | 47.4 MB | ###5 | 35%  2025-05-07T19:45:20.0107413Z 2025-05-07T19:45:20.0107457Z 2025-05-07T19:45:20.0459612Z python-3.11.11 | 29.2 MB | #####9 | 60%  2025-05-07T19:45:20.0459906Z 2025-05-07T19:45:20.0459910Z 2025-05-07T19:45:20.0459913Z 2025-05-07T19:45:20.0459917Z 2025-05-07T19:45:20.0459921Z 2025-05-07T19:45:20.0504449Z openblas-0.3.29 | 5.8 MB | ########7 | 88%  2025-05-07T19:45:20.1010956Z openjdk-23.0.1 | 181.3 MB | 9 | 9% 2025-05-07T19:45:20.1011304Z 2025-05-07T19:45:20.1315262Z bazel-7.5.0 | 47.4 MB | ####9 | 49%  2025-05-07T19:45:20.1315558Z 2025-05-07T19:45:20.1315562Z 2025-05-07T19:45:20.1315585Z 2025-05-07T19:45:20.1315590Z 2025-05-07T19:45:20.1315593Z 2025-05-07T19:45:20.1506433Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:20.1786657Z openjdk-23.0.1 | 181.3 MB | #3 | 14% 2025-05-07T19:45:20.1786976Z 2025-05-07T19:45:20.1787001Z 2025-05-07T19:45:20.1787007Z 2025-05-07T19:45:20.1787011Z 2025-05-07T19:45:20.1787015Z 2025-05-07T19:45:20.1787020Z 2025-05-07T19:45:20.2012632Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:20.2012970Z 2025-05-07T19:45:20.2147998Z bazel-7.5.0 | 47.4 MB | ######4 | 65%  2025-05-07T19:45:20.2148779Z 2025-05-07T19:45:20.2148784Z 2025-05-07T19:45:20.2454046Z 2025-05-07T19:45:20.2455011Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:20.2455836Z 2025-05-07T19:45:20.2455848Z 2025-05-07T19:45:20.2455859Z 2025-05-07T19:45:20.2455869Z 2025-05-07T19:45:20.2455880Z 2025-05-07T19:45:20.2455890Z 2025-05-07T19:45:20.2456322Z 2025-05-07T19:45:20.2488303Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:20.2488613Z 2025-05-07T19:45:20.2491863Z 2025-05-07T19:45:20.2572218Z python-3.11.11 | 29.2 MB | #######7 | 77%  2025-05-07T19:45:20.2789105Z openjdk-23.0.1 | 181.3 MB | #6 | 17% 2025-05-07T19:45:20.2789634Z 2025-05-07T19:45:20.2789687Z 2025-05-07T19:45:20.2789693Z 2025-05-07T19:45:20.2789697Z 2025-05-07T19:45:20.2789700Z 2025-05-07T19:45:20.2789733Z 2025-05-07T19:45:20.3457025Z libopenblas-0.3.29 | 5.6 MB | ########4 | 84%  2025-05-07T19:45:20.3457954Z 2025-05-07T19:45:20.3457968Z 2025-05-07T19:45:20.3457979Z 2025-05-07T19:45:20.3457989Z 2025-05-07T19:45:20.3457999Z 2025-05-07T19:45:20.3458009Z 2025-05-07T19:45:20.3458019Z 2025-05-07T19:45:20.3488059Z libcups-2.3.3 | 4.3 MB | #########7 | 98%  2025-05-07T19:45:20.3488904Z 2025-05-07T19:45:20.3488916Z 2025-05-07T19:45:20.3571023Z python-3.11.11 | 29.2 MB | #########2 | 92%  2025-05-07T19:45:20.3571882Z 2025-05-07T19:45:20.3975447Z bazel-7.5.0 | 47.4 MB | #######7 | 78%  2025-05-07T19:45:20.4216094Z openjdk-23.0.1 | 181.3 MB | ## | 20% 2025-05-07T19:45:20.4217309Z 2025-05-07T19:45:20.4217348Z 2025-05-07T19:45:20.4217359Z 2025-05-07T19:45:20.4217370Z 2025-05-07T19:45:20.4217381Z 2025-05-07T19:45:20.4217392Z 2025-05-07T19:45:20.4217402Z 2025-05-07T19:45:20.4234890Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:20.4235200Z 2025-05-07T19:45:20.4235204Z 2025-05-07T19:45:20.4235208Z 2025-05-07T19:45:20.4235211Z 2025-05-07T19:45:20.4235215Z 2025-05-07T19:45:20.4235226Z 2025-05-07T19:45:20.4561326Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:20.4561959Z 2025-05-07T19:45:20.4561964Z 2025-05-07T19:45:20.4561967Z 2025-05-07T19:45:20.4561971Z 2025-05-07T19:45:20.4561988Z 2025-05-07T19:45:20.4561991Z 2025-05-07T19:45:20.4561995Z 2025-05-07T19:45:20.4561998Z 2025-05-07T19:45:20.4601549Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:20.4602479Z 2025-05-07T19:45:20.4902320Z bazel-7.5.0 | 47.4 MB | #########3 | 93%  2025-05-07T19:45:20.4902628Z 2025-05-07T19:45:20.4902778Z 2025-05-07T19:45:20.4902936Z 2025-05-07T19:45:20.4902947Z 2025-05-07T19:45:20.4902951Z 2025-05-07T19:45:20.4902955Z 2025-05-07T19:45:20.4902959Z 2025-05-07T19:45:20.4902963Z 2025-05-07T19:45:20.4902995Z 2025-05-07T19:45:20.5265273Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:20.5721599Z openjdk-23.0.1 | 181.3 MB | ##2 | 23% 2025-05-07T19:45:20.5721945Z 2025-05-07T19:45:20.5721951Z 2025-05-07T19:45:20.5721955Z 2025-05-07T19:45:20.5721958Z 2025-05-07T19:45:20.5721962Z 2025-05-07T19:45:20.5721967Z 2025-05-07T19:45:20.5721971Z 2025-05-07T19:45:20.5721975Z 2025-05-07T19:45:20.5722547Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:20.5722887Z 2025-05-07T19:45:20.5722892Z 2025-05-07T19:45:20.5722896Z 2025-05-07T19:45:20.5722900Z 2025-05-07T19:45:20.5722903Z 2025-05-07T19:45:20.5722907Z 2025-05-07T19:45:20.5722917Z 2025-05-07T19:45:20.5722935Z 2025-05-07T19:45:20.5867320Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:20.5867674Z 2025-05-07T19:45:20.5867679Z 2025-05-07T19:45:20.5867682Z 2025-05-07T19:45:20.5867686Z 2025-05-07T19:45:20.5867689Z 2025-05-07T19:45:20.5867693Z 2025-05-07T19:45:20.5867696Z 2025-05-07T19:45:20.5867700Z 2025-05-07T19:45:20.5867704Z 2025-05-07T19:45:20.6227659Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.6228633Z 2025-05-07T19:45:20.6228676Z 2025-05-07T19:45:20.6228689Z 2025-05-07T19:45:20.6228730Z 2025-05-07T19:45:20.6228741Z 2025-05-07T19:45:20.6228751Z 2025-05-07T19:45:20.6229180Z 2025-05-07T19:45:20.6229194Z 2025-05-07T19:45:20.6229204Z 2025-05-07T19:45:20.6229214Z 2025-05-07T19:45:20.6281221Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:20.6281651Z 2025-05-07T19:45:20.6281923Z 2025-05-07T19:45:20.6281947Z 2025-05-07T19:45:20.6281954Z 2025-05-07T19:45:20.6281959Z 2025-05-07T19:45:20.6281964Z 2025-05-07T19:45:20.6281969Z 2025-05-07T19:45:20.6281974Z 2025-05-07T19:45:20.6282016Z 2025-05-07T19:45:20.6282021Z 2025-05-07T19:45:20.6283703Z 2025-05-07T19:45:20.6454806Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:20.6858450Z openjdk-23.0.1 | 181.3 MB | ##7 | 27% 2025-05-07T19:45:20.6858927Z 2025-05-07T19:45:20.6858973Z 2025-05-07T19:45:20.6858980Z 2025-05-07T19:45:20.6859005Z 2025-05-07T19:45:20.6859010Z 2025-05-07T19:45:20.6859049Z 2025-05-07T19:45:20.6859053Z 2025-05-07T19:45:20.6859057Z 2025-05-07T19:45:20.6859133Z 2025-05-07T19:45:20.6859150Z 2025-05-07T19:45:20.6859183Z 2025-05-07T19:45:20.7016770Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.7017774Z 2025-05-07T19:45:20.7017788Z 2025-05-07T19:45:20.7017799Z 2025-05-07T19:45:20.7017809Z 2025-05-07T19:45:20.7101469Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:20.7102244Z 2025-05-07T19:45:20.7102249Z 2025-05-07T19:45:20.7102253Z 2025-05-07T19:45:20.7102256Z 2025-05-07T19:45:20.7102260Z 2025-05-07T19:45:20.7102263Z 2025-05-07T19:45:20.7102267Z 2025-05-07T19:45:20.7102270Z 2025-05-07T19:45:20.7102274Z 2025-05-07T19:45:20.7102277Z 2025-05-07T19:45:20.7350225Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:20.7350546Z 2025-05-07T19:45:20.7350550Z 2025-05-07T19:45:20.7350554Z 2025-05-07T19:45:20.7350558Z 2025-05-07T19:45:20.7350561Z 2025-05-07T19:45:20.7350565Z 2025-05-07T19:45:20.7350593Z 2025-05-07T19:45:20.7350610Z 2025-05-07T19:45:20.7350614Z 2025-05-07T19:45:20.7350617Z 2025-05-07T19:45:20.7350620Z 2025-05-07T19:45:20.7351098Z 2025-05-07T19:45:20.7487801Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:20.7488783Z 2025-05-07T19:45:20.7488829Z 2025-05-07T19:45:20.7488840Z 2025-05-07T19:45:20.7488851Z 2025-05-07T19:45:20.7488861Z 2025-05-07T19:45:20.7488872Z 2025-05-07T19:45:20.7488882Z 2025-05-07T19:45:20.7488921Z 2025-05-07T19:45:20.7488931Z 2025-05-07T19:45:20.7488942Z 2025-05-07T19:45:20.7488952Z 2025-05-07T19:45:20.7488962Z 2025-05-07T19:45:20.7492103Z 2025-05-07T19:45:20.7526279Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:20.7527296Z 2025-05-07T19:45:20.7527356Z 2025-05-07T19:45:20.7597017Z python-3.11.11 | 29.2 MB | ########## | 100%  2025-05-07T19:45:20.7816326Z openjdk-23.0.1 | 181.3 MB | ### | 30% 2025-05-07T19:45:20.7817174Z 2025-05-07T19:45:20.7817191Z 2025-05-07T19:45:20.7817203Z 2025-05-07T19:45:20.7817214Z 2025-05-07T19:45:20.7817224Z 2025-05-07T19:45:20.7817235Z 2025-05-07T19:45:20.7817245Z 2025-05-07T19:45:20.7817255Z 2025-05-07T19:45:20.7817266Z 2025-05-07T19:45:20.7817276Z 2025-05-07T19:45:20.7817286Z 2025-05-07T19:45:20.7817309Z 2025-05-07T19:45:20.7817319Z 2025-05-07T19:45:20.7817329Z 2025-05-07T19:45:20.7880660Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:20.7880982Z 2025-05-07T19:45:20.7880987Z 2025-05-07T19:45:20.7880991Z 2025-05-07T19:45:20.7880995Z 2025-05-07T19:45:20.7880998Z 2025-05-07T19:45:20.7881002Z 2025-05-07T19:45:20.7881005Z 2025-05-07T19:45:20.7881009Z 2025-05-07T19:45:20.7881012Z 2025-05-07T19:45:20.7881016Z 2025-05-07T19:45:20.7881019Z 2025-05-07T19:45:20.7881174Z 2025-05-07T19:45:20.7995253Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.7996649Z 2025-05-07T19:45:20.7996702Z 2025-05-07T19:45:20.7996714Z 2025-05-07T19:45:20.7996724Z 2025-05-07T19:45:20.7996734Z 2025-05-07T19:45:20.7996744Z 2025-05-07T19:45:20.7996784Z 2025-05-07T19:45:20.7996795Z 2025-05-07T19:45:20.7996806Z 2025-05-07T19:45:20.7996816Z 2025-05-07T19:45:20.7996826Z 2025-05-07T19:45:20.7996853Z 2025-05-07T19:45:20.7996864Z 2025-05-07T19:45:20.8177132Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:20.8178191Z 2025-05-07T19:45:20.8178234Z 2025-05-07T19:45:20.8178245Z 2025-05-07T19:45:20.8178256Z 2025-05-07T19:45:20.8178266Z 2025-05-07T19:45:20.8178277Z 2025-05-07T19:45:20.8178288Z 2025-05-07T19:45:20.8178299Z 2025-05-07T19:45:20.8178309Z 2025-05-07T19:45:20.8178319Z 2025-05-07T19:45:20.8178329Z 2025-05-07T19:45:20.8178339Z 2025-05-07T19:45:20.8178350Z 2025-05-07T19:45:20.8178360Z 2025-05-07T19:45:20.8309978Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:20.8310373Z 2025-05-07T19:45:20.8310377Z 2025-05-07T19:45:20.8310381Z 2025-05-07T19:45:20.8310385Z 2025-05-07T19:45:20.8310388Z 2025-05-07T19:45:20.8310392Z 2025-05-07T19:45:20.8310395Z 2025-05-07T19:45:20.8310451Z 2025-05-07T19:45:20.8310454Z 2025-05-07T19:45:20.8310460Z 2025-05-07T19:45:20.8310666Z 2025-05-07T19:45:20.8310681Z 2025-05-07T19:45:20.8310684Z 2025-05-07T19:45:20.8310688Z 2025-05-07T19:45:20.8310691Z 2025-05-07T19:45:20.8312816Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:20.8313188Z 2025-05-07T19:45:20.8313192Z 2025-05-07T19:45:20.8313196Z 2025-05-07T19:45:20.8313199Z 2025-05-07T19:45:20.8313202Z 2025-05-07T19:45:20.8313206Z 2025-05-07T19:45:20.8313209Z 2025-05-07T19:45:20.8313213Z 2025-05-07T19:45:20.8313216Z 2025-05-07T19:45:20.8313243Z 2025-05-07T19:45:20.8313246Z 2025-05-07T19:45:20.8313250Z 2025-05-07T19:45:20.8313253Z 2025-05-07T19:45:20.8313257Z 2025-05-07T19:45:20.8313266Z 2025-05-07T19:45:20.8313270Z 2025-05-07T19:45:20.8547443Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:20.8548419Z 2025-05-07T19:45:20.8548434Z 2025-05-07T19:45:20.8548445Z 2025-05-07T19:45:20.8548454Z 2025-05-07T19:45:20.8548494Z 2025-05-07T19:45:20.8548505Z 2025-05-07T19:45:20.8548516Z 2025-05-07T19:45:20.8548526Z 2025-05-07T19:45:20.8548537Z 2025-05-07T19:45:20.8548547Z 2025-05-07T19:45:20.8548558Z 2025-05-07T19:45:20.8548568Z 2025-05-07T19:45:20.8548578Z 2025-05-07T19:45:20.8548588Z 2025-05-07T19:45:20.8548598Z 2025-05-07T19:45:20.8548609Z 2025-05-07T19:45:20.8548619Z 2025-05-07T19:45:20.8599800Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:20.8633399Z openjdk-23.0.1 | 181.3 MB | ###4 | 34% 2025-05-07T19:45:20.8634183Z 2025-05-07T19:45:20.8634222Z 2025-05-07T19:45:20.8634234Z 2025-05-07T19:45:20.8634244Z 2025-05-07T19:45:20.8634285Z 2025-05-07T19:45:20.8644407Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:20.8644713Z 2025-05-07T19:45:20.8644717Z 2025-05-07T19:45:20.8644720Z 2025-05-07T19:45:20.8644724Z 2025-05-07T19:45:20.8644752Z 2025-05-07T19:45:20.8644756Z 2025-05-07T19:45:20.8644765Z 2025-05-07T19:45:20.8644768Z 2025-05-07T19:45:20.8644772Z 2025-05-07T19:45:20.8644775Z 2025-05-07T19:45:20.8644779Z 2025-05-07T19:45:20.8644782Z 2025-05-07T19:45:20.8644785Z 2025-05-07T19:45:20.8644789Z 2025-05-07T19:45:20.8644792Z 2025-05-07T19:45:20.8646553Z 2025-05-07T19:45:20.8794272Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:20.8794628Z 2025-05-07T19:45:20.8794632Z 2025-05-07T19:45:20.8794636Z 2025-05-07T19:45:20.8794665Z 2025-05-07T19:45:20.8794668Z 2025-05-07T19:45:20.8794672Z 2025-05-07T19:45:20.8794675Z 2025-05-07T19:45:20.8794679Z 2025-05-07T19:45:20.8794682Z 2025-05-07T19:45:20.8794867Z 2025-05-07T19:45:20.8794872Z 2025-05-07T19:45:20.8794875Z 2025-05-07T19:45:20.8794879Z 2025-05-07T19:45:20.8794882Z 2025-05-07T19:45:20.8794886Z 2025-05-07T19:45:20.8851164Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:20.8852303Z 2025-05-07T19:45:20.8852351Z 2025-05-07T19:45:20.8852362Z 2025-05-07T19:45:20.8852373Z 2025-05-07T19:45:20.8852383Z 2025-05-07T19:45:20.8852394Z 2025-05-07T19:45:20.8852404Z 2025-05-07T19:45:20.8852414Z 2025-05-07T19:45:20.8852424Z 2025-05-07T19:45:20.8852435Z 2025-05-07T19:45:20.8852445Z 2025-05-07T19:45:20.8852455Z 2025-05-07T19:45:20.8852466Z 2025-05-07T19:45:20.8852476Z 2025-05-07T19:45:20.8852487Z 2025-05-07T19:45:20.8852497Z 2025-05-07T19:45:20.8852507Z 2025-05-07T19:45:20.8993865Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:20.8994881Z 2025-05-07T19:45:20.8994897Z 2025-05-07T19:45:20.8994908Z 2025-05-07T19:45:20.8994949Z 2025-05-07T19:45:20.8994960Z 2025-05-07T19:45:20.8994970Z 2025-05-07T19:45:20.8994980Z 2025-05-07T19:45:20.9037270Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:20.9038194Z 2025-05-07T19:45:20.9038208Z 2025-05-07T19:45:20.9038606Z 2025-05-07T19:45:20.9038617Z 2025-05-07T19:45:20.9038627Z 2025-05-07T19:45:20.9038637Z 2025-05-07T19:45:20.9038647Z 2025-05-07T19:45:20.9038658Z 2025-05-07T19:45:20.9038668Z 2025-05-07T19:45:20.9038708Z 2025-05-07T19:45:20.9038718Z 2025-05-07T19:45:20.9038728Z 2025-05-07T19:45:20.9038738Z 2025-05-07T19:45:20.9038748Z 2025-05-07T19:45:20.9038759Z 2025-05-07T19:45:20.9038769Z 2025-05-07T19:45:20.9038778Z 2025-05-07T19:45:20.9038788Z 2025-05-07T19:45:20.9277287Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:20.9278390Z 2025-05-07T19:45:20.9278403Z 2025-05-07T19:45:20.9278414Z 2025-05-07T19:45:20.9278424Z 2025-05-07T19:45:20.9278465Z 2025-05-07T19:45:20.9278477Z 2025-05-07T19:45:20.9278487Z 2025-05-07T19:45:20.9278498Z 2025-05-07T19:45:20.9278508Z 2025-05-07T19:45:20.9278518Z 2025-05-07T19:45:20.9278529Z 2025-05-07T19:45:20.9278539Z 2025-05-07T19:45:20.9278550Z 2025-05-07T19:45:20.9278560Z 2025-05-07T19:45:20.9278584Z 2025-05-07T19:45:20.9278595Z 2025-05-07T19:45:20.9278605Z 2025-05-07T19:45:20.9278614Z 2025-05-07T19:45:20.9305013Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:20.9305436Z 2025-05-07T19:45:20.9305441Z 2025-05-07T19:45:20.9305445Z 2025-05-07T19:45:20.9305448Z 2025-05-07T19:45:20.9305452Z 2025-05-07T19:45:20.9305455Z 2025-05-07T19:45:20.9305458Z 2025-05-07T19:45:20.9305462Z 2025-05-07T19:45:20.9305465Z 2025-05-07T19:45:20.9305495Z 2025-05-07T19:45:20.9305499Z 2025-05-07T19:45:20.9305502Z 2025-05-07T19:45:20.9305505Z 2025-05-07T19:45:20.9305509Z 2025-05-07T19:45:20.9305512Z 2025-05-07T19:45:20.9305530Z 2025-05-07T19:45:20.9305534Z 2025-05-07T19:45:20.9305537Z 2025-05-07T19:45:20.9305541Z 2025-05-07T19:45:20.9602727Z ... (more hidden) ... 2025-05-07T19:45:20.9634065Z openjdk-23.0.1 | 181.3 MB | ###7 | 38% 2025-05-07T19:45:20.9634376Z 2025-05-07T19:45:20.9634381Z 2025-05-07T19:45:20.9634384Z 2025-05-07T19:45:20.9634388Z 2025-05-07T19:45:20.9634391Z 2025-05-07T19:45:20.9634395Z 2025-05-07T19:45:20.9634398Z 2025-05-07T19:45:20.9634427Z 2025-05-07T19:45:20.9634431Z 2025-05-07T19:45:20.9634434Z 2025-05-07T19:45:20.9634438Z 2025-05-07T19:45:20.9634441Z 2025-05-07T19:45:20.9634445Z 2025-05-07T19:45:20.9634448Z 2025-05-07T19:45:20.9634451Z 2025-05-07T19:45:20.9634455Z 2025-05-07T19:45:20.9634458Z 2025-05-07T19:45:20.9634462Z 2025-05-07T19:45:20.9634465Z 2025-05-07T19:45:21.0604908Z ... (more hidden) ... 2025-05-07T19:45:21.1097571Z openjdk-23.0.1 | 181.3 MB | ####1 | 41% 2025-05-07T19:45:21.1098387Z 2025-05-07T19:45:21.1605719Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:21.1872417Z openjdk-23.0.1 | 181.3 MB | ####4 | 45% 2025-05-07T19:45:21.1873118Z 2025-05-07T19:45:21.1873143Z 2025-05-07T19:45:21.1873167Z 2025-05-07T19:45:21.1873191Z 2025-05-07T19:45:21.1873195Z 2025-05-07T19:45:21.1873198Z 2025-05-07T19:45:21.2606666Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:21.2711174Z openjdk-23.0.1 | 181.3 MB | ####9 | 50% 2025-05-07T19:45:21.2711993Z 2025-05-07T19:45:21.2712007Z 2025-05-07T19:45:21.2712018Z 2025-05-07T19:45:21.2712028Z 2025-05-07T19:45:21.2712039Z 2025-05-07T19:45:21.2712049Z 2025-05-07T19:45:21.2712059Z 2025-05-07T19:45:21.2712069Z 2025-05-07T19:45:21.3556719Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:21.3557630Z 2025-05-07T19:45:21.3557644Z 2025-05-07T19:45:21.3557684Z 2025-05-07T19:45:21.3557725Z 2025-05-07T19:45:21.3557736Z 2025-05-07T19:45:21.3557746Z 2025-05-07T19:45:21.3557757Z 2025-05-07T19:45:21.3557767Z 2025-05-07T19:45:21.3557778Z 2025-05-07T19:45:21.3557788Z 2025-05-07T19:45:21.3557798Z 2025-05-07T19:45:21.3559187Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.3560649Z 2025-05-07T19:45:21.3560661Z 2025-05-07T19:45:21.3560671Z 2025-05-07T19:45:21.3560681Z 2025-05-07T19:45:21.3560691Z 2025-05-07T19:45:21.3560701Z 2025-05-07T19:45:21.3560712Z 2025-05-07T19:45:21.3560722Z 2025-05-07T19:45:21.3560732Z 2025-05-07T19:45:21.3560742Z 2025-05-07T19:45:21.3560779Z 2025-05-07T19:45:21.3929739Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.5027385Z openjdk-23.0.1 | 181.3 MB | #####3 | 54% 2025-05-07T19:45:21.6362651Z openjdk-23.0.1 | 181.3 MB | #####7 | 58% 2025-05-07T19:45:21.6428297Z openjdk-23.0.1 | 181.3 MB | ######1 | 61% 2025-05-07T19:45:21.6428920Z 2025-05-07T19:45:21.6428925Z 2025-05-07T19:45:21.6428929Z 2025-05-07T19:45:21.6428952Z 2025-05-07T19:45:21.6428956Z 2025-05-07T19:45:21.6428959Z 2025-05-07T19:45:21.6428963Z 2025-05-07T19:45:21.6428966Z 2025-05-07T19:45:21.6429129Z 2025-05-07T19:45:21.6435605Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:21.6435990Z 2025-05-07T19:45:21.6435995Z 2025-05-07T19:45:21.6435999Z 2025-05-07T19:45:21.6436003Z 2025-05-07T19:45:21.6436007Z 2025-05-07T19:45:21.6436010Z 2025-05-07T19:45:21.6436014Z 2025-05-07T19:45:21.6436017Z 2025-05-07T19:45:21.6436021Z 2025-05-07T19:45:21.7596227Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:21.7597246Z 2025-05-07T19:45:21.7597259Z 2025-05-07T19:45:21.7597270Z 2025-05-07T19:45:21.7597281Z 2025-05-07T19:45:21.7597291Z 2025-05-07T19:45:21.7597301Z 2025-05-07T19:45:21.7597311Z 2025-05-07T19:45:21.7597351Z 2025-05-07T19:45:21.7597362Z 2025-05-07T19:45:21.7597373Z 2025-05-07T19:45:21.7598099Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:21.7598880Z 2025-05-07T19:45:21.7598891Z 2025-05-07T19:45:21.7598901Z 2025-05-07T19:45:21.7598912Z 2025-05-07T19:45:21.7598940Z 2025-05-07T19:45:21.7598950Z 2025-05-07T19:45:21.7598960Z 2025-05-07T19:45:21.7598970Z 2025-05-07T19:45:21.7598980Z 2025-05-07T19:45:21.7598991Z 2025-05-07T19:45:21.8011429Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:21.9011878Z openjdk-23.0.1 | 181.3 MB | ######4 | 64% 2025-05-07T19:45:21.9146400Z openjdk-23.0.1 | 181.3 MB | ######8 | 68% 2025-05-07T19:45:21.9147249Z 2025-05-07T19:45:21.9147263Z 2025-05-07T19:45:21.9147274Z 2025-05-07T19:45:21.9147284Z 2025-05-07T19:45:21.9147295Z 2025-05-07T19:45:21.9147306Z 2025-05-07T19:45:21.9147342Z 2025-05-07T19:45:21.9147353Z 2025-05-07T19:45:21.9149108Z 2025-05-07T19:45:21.9149117Z 2025-05-07T19:45:21.9149120Z 2025-05-07T19:45:21.9149124Z 2025-05-07T19:45:21.9149553Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:21.9149869Z 2025-05-07T19:45:21.9149873Z 2025-05-07T19:45:21.9149894Z 2025-05-07T19:45:21.9149912Z 2025-05-07T19:45:21.9149915Z 2025-05-07T19:45:21.9149919Z 2025-05-07T19:45:21.9149923Z 2025-05-07T19:45:21.9149926Z 2025-05-07T19:45:21.9149929Z 2025-05-07T19:45:21.9149933Z 2025-05-07T19:45:21.9149936Z 2025-05-07T19:45:21.9149940Z 2025-05-07T19:45:22.0012735Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.0123604Z openjdk-23.0.1 | 181.3 MB | #######4 | 75% 2025-05-07T19:45:22.0124395Z 2025-05-07T19:45:22.0124409Z 2025-05-07T19:45:22.0124420Z 2025-05-07T19:45:22.0124431Z 2025-05-07T19:45:22.0124442Z 2025-05-07T19:45:22.0124452Z 2025-05-07T19:45:22.0124463Z 2025-05-07T19:45:22.0124473Z 2025-05-07T19:45:22.0124540Z 2025-05-07T19:45:22.0124552Z 2025-05-07T19:45:22.0124562Z 2025-05-07T19:45:22.0124573Z 2025-05-07T19:45:22.0124601Z 2025-05-07T19:45:22.0125824Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.0126787Z 2025-05-07T19:45:22.0127207Z 2025-05-07T19:45:22.0127273Z 2025-05-07T19:45:22.0127284Z 2025-05-07T19:45:22.0127294Z 2025-05-07T19:45:22.0127304Z 2025-05-07T19:45:22.0127315Z 2025-05-07T19:45:22.0127324Z 2025-05-07T19:45:22.0127334Z 2025-05-07T19:45:22.0127344Z 2025-05-07T19:45:22.0127354Z 2025-05-07T19:45:22.0127365Z 2025-05-07T19:45:22.0127375Z 2025-05-07T19:45:22.1312335Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:22.1312866Z 2025-05-07T19:45:22.1312871Z 2025-05-07T19:45:22.1312874Z 2025-05-07T19:45:22.1312878Z 2025-05-07T19:45:22.1312881Z 2025-05-07T19:45:22.1312885Z 2025-05-07T19:45:22.1312888Z 2025-05-07T19:45:22.1312892Z 2025-05-07T19:45:22.1312914Z 2025-05-07T19:45:22.1312917Z 2025-05-07T19:45:22.1312921Z 2025-05-07T19:45:22.1312924Z 2025-05-07T19:45:22.1312928Z 2025-05-07T19:45:22.1312931Z 2025-05-07T19:45:22.1313222Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.1313549Z 2025-05-07T19:45:22.1313553Z 2025-05-07T19:45:22.1313574Z 2025-05-07T19:45:22.1313578Z 2025-05-07T19:45:22.1313581Z 2025-05-07T19:45:22.1313585Z 2025-05-07T19:45:22.1313588Z 2025-05-07T19:45:22.1313592Z 2025-05-07T19:45:22.1313596Z 2025-05-07T19:45:22.1313599Z 2025-05-07T19:45:22.1313602Z 2025-05-07T19:45:22.1313606Z 2025-05-07T19:45:22.1313609Z 2025-05-07T19:45:22.1313613Z 2025-05-07T19:45:22.1325214Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.1875644Z openjdk-23.0.1 | 181.3 MB | #######9 | 80% 2025-05-07T19:45:22.1876517Z 2025-05-07T19:45:22.1876530Z 2025-05-07T19:45:22.1876541Z 2025-05-07T19:45:22.1876583Z 2025-05-07T19:45:22.1876595Z 2025-05-07T19:45:22.1876605Z 2025-05-07T19:45:22.1876615Z 2025-05-07T19:45:22.1876626Z 2025-05-07T19:45:22.1876636Z 2025-05-07T19:45:22.1876646Z 2025-05-07T19:45:22.1876656Z 2025-05-07T19:45:22.1876666Z 2025-05-07T19:45:22.1876676Z 2025-05-07T19:45:22.1876701Z 2025-05-07T19:45:22.1876711Z 2025-05-07T19:45:22.1876721Z 2025-05-07T19:45:22.1877760Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:22.1878678Z 2025-05-07T19:45:22.1878689Z 2025-05-07T19:45:22.1878700Z 2025-05-07T19:45:22.1878710Z 2025-05-07T19:45:22.1878720Z 2025-05-07T19:45:22.1878730Z 2025-05-07T19:45:22.1878740Z 2025-05-07T19:45:22.1878750Z 2025-05-07T19:45:22.1878760Z 2025-05-07T19:45:22.1878770Z 2025-05-07T19:45:22.1878780Z 2025-05-07T19:45:22.1878818Z 2025-05-07T19:45:22.1878828Z 2025-05-07T19:45:22.1878838Z 2025-05-07T19:45:22.1878848Z 2025-05-07T19:45:22.1878859Z 2025-05-07T19:45:22.2330522Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:22.3330469Z openjdk-23.0.1 | 181.3 MB | ########6 | 86% 2025-05-07T19:45:22.4575507Z openjdk-23.0.1 | 181.3 MB | #########2 | 93% 2025-05-07T19:45:22.6785102Z openjdk-23.0.1 | 181.3 MB | #########8 | 98% 2025-05-07T19:45:22.6786379Z 2025-05-07T19:45:22.6786393Z 2025-05-07T19:45:22.6786404Z 2025-05-07T19:45:22.6786414Z 2025-05-07T19:45:22.6786424Z 2025-05-07T19:45:22.6786435Z 2025-05-07T19:45:22.6786445Z 2025-05-07T19:45:22.6786455Z 2025-05-07T19:45:22.6786465Z 2025-05-07T19:45:22.6786475Z 2025-05-07T19:45:22.6786485Z 2025-05-07T19:45:22.6786496Z 2025-05-07T19:45:22.6786506Z 2025-05-07T19:45:22.6786516Z 2025-05-07T19:45:22.6786548Z 2025-05-07T19:45:22.6787670Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.6788689Z 2025-05-07T19:45:22.6788700Z 2025-05-07T19:45:22.6788710Z 2025-05-07T19:45:22.6788736Z 2025-05-07T19:45:22.6788747Z 2025-05-07T19:45:22.6788757Z 2025-05-07T19:45:22.6788767Z 2025-05-07T19:45:22.6788777Z 2025-05-07T19:45:22.6788788Z 2025-05-07T19:45:22.6788820Z 2025-05-07T19:45:22.6788830Z 2025-05-07T19:45:22.6788840Z 2025-05-07T19:45:22.6788851Z 2025-05-07T19:45:22.6789274Z 2025-05-07T19:45:22.6789284Z 2025-05-07T19:45:22.8585292Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:22.8585926Z 2025-05-07T19:45:22.8585930Z 2025-05-07T19:45:22.8585934Z 2025-05-07T19:45:22.8585938Z 2025-05-07T19:45:22.8585942Z 2025-05-07T19:45:22.8586120Z 2025-05-07T19:45:22.8586124Z 2025-05-07T19:45:22.8586128Z 2025-05-07T19:45:22.8586131Z 2025-05-07T19:45:22.8586135Z 2025-05-07T19:45:22.8586138Z 2025-05-07T19:45:22.8586142Z 2025-05-07T19:45:22.8586145Z 2025-05-07T19:45:22.8586149Z 2025-05-07T19:45:22.8586152Z 2025-05-07T19:45:22.8586156Z 2025-05-07T19:45:22.8586159Z 2025-05-07T19:45:22.8586541Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:22.8586881Z 2025-05-07T19:45:22.8586885Z 2025-05-07T19:45:22.8586889Z 2025-05-07T19:45:22.8586892Z 2025-05-07T19:45:22.8586896Z 2025-05-07T19:45:22.8586900Z 2025-05-07T19:45:22.8586910Z 2025-05-07T19:45:22.8586914Z 2025-05-07T19:45:22.8586917Z 2025-05-07T19:45:22.8586921Z 2025-05-07T19:45:22.8586924Z 2025-05-07T19:45:22.8586927Z 2025-05-07T19:45:22.8586931Z 2025-05-07T19:45:22.8586946Z 2025-05-07T19:45:22.8586949Z 2025-05-07T19:45:22.8586953Z 2025-05-07T19:45:22.8586956Z 2025-05-07T19:45:22.8847103Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:22.8848126Z 2025-05-07T19:45:22.8848194Z 2025-05-07T19:45:22.8848206Z 2025-05-07T19:45:22.8848216Z 2025-05-07T19:45:22.8848227Z 2025-05-07T19:45:22.8848237Z 2025-05-07T19:45:22.8848247Z 2025-05-07T19:45:22.8848257Z 2025-05-07T19:45:22.8848267Z 2025-05-07T19:45:22.8848308Z 2025-05-07T19:45:22.8848320Z 2025-05-07T19:45:22.8848330Z 2025-05-07T19:45:22.8848340Z 2025-05-07T19:45:22.8848350Z 2025-05-07T19:45:22.8848361Z 2025-05-07T19:45:22.8848371Z 2025-05-07T19:45:22.8848409Z 2025-05-07T19:45:22.8848420Z 2025-05-07T19:45:22.8849374Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:22.8850386Z 2025-05-07T19:45:22.8850397Z 2025-05-07T19:45:22.8850407Z 2025-05-07T19:45:22.8850417Z 2025-05-07T19:45:22.8850426Z 2025-05-07T19:45:22.8850437Z 2025-05-07T19:45:22.8850447Z 2025-05-07T19:45:22.8850457Z 2025-05-07T19:45:22.8850491Z 2025-05-07T19:45:22.8850501Z 2025-05-07T19:45:22.8850511Z 2025-05-07T19:45:22.8850522Z 2025-05-07T19:45:22.8850532Z 2025-05-07T19:45:22.8850543Z 2025-05-07T19:45:22.8850552Z 2025-05-07T19:45:22.8850563Z 2025-05-07T19:45:22.8850572Z 2025-05-07T19:45:22.8850582Z 2025-05-07T19:45:23.4322725Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:23.4323144Z 2025-05-07T19:45:23.4323149Z 2025-05-07T19:45:23.4323152Z 2025-05-07T19:45:23.8048924Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:23.8049406Z 2025-05-07T19:45:23.8049410Z 2025-05-07T19:45:24.2152056Z python-3.11.11 | 29.2 MB | ########## | 100%  2025-05-07T19:45:24.2152708Z 2025-05-07T19:45:24.2152713Z 2025-05-07T19:45:24.2152716Z 2025-05-07T19:45:24.2152720Z 2025-05-07T19:45:24.2152723Z 2025-05-07T19:45:24.2152727Z 2025-05-07T19:45:24.2152731Z 2025-05-07T19:45:24.2152734Z 2025-05-07T19:45:24.2152738Z 2025-05-07T19:45:24.2152741Z 2025-05-07T19:45:24.2152745Z 2025-05-07T19:45:24.2152749Z 2025-05-07T19:45:24.2152752Z 2025-05-07T19:45:24.2152756Z 2025-05-07T19:45:24.2152759Z 2025-05-07T19:45:24.2152762Z 2025-05-07T19:45:24.2152766Z 2025-05-07T19:45:24.2152769Z 2025-05-07T19:45:24.2152773Z 2025-05-07T19:45:24.2153089Z ... (more hidden) ... 2025-05-07T19:45:24.2153400Z 2025-05-07T19:45:24.2153404Z 2025-05-07T19:45:24.2153407Z 2025-05-07T19:45:24.2153411Z 2025-05-07T19:45:24.2153414Z 2025-05-07T19:45:24.2153418Z 2025-05-07T19:45:24.2153421Z 2025-05-07T19:45:24.2153424Z 2025-05-07T19:45:24.2153428Z 2025-05-07T19:45:24.2153683Z 2025-05-07T19:45:24.2153686Z 2025-05-07T19:45:24.2153690Z 2025-05-07T19:45:24.2153693Z 2025-05-07T19:45:24.2153697Z 2025-05-07T19:45:24.2153701Z 2025-05-07T19:45:24.2153726Z 2025-05-07T19:45:24.2153730Z 2025-05-07T19:45:24.2153733Z 2025-05-07T19:45:24.2153737Z 2025-05-07T19:45:24.3355222Z ... (more hidden) ... 2025-05-07T19:45:25.0086317Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:25.0087118Z 2025-05-07T19:45:25.8550277Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:25.8554396Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:25.8554703Z 2025-05-07T19:45:25.8554727Z 2025-05-07T19:45:25.8554731Z 2025-05-07T19:45:25.8554735Z 2025-05-07T19:45:25.8554739Z 2025-05-07T19:45:25.8554743Z 2025-05-07T19:45:25.8554746Z 2025-05-07T19:45:25.8554750Z 2025-05-07T19:45:25.8554753Z 2025-05-07T19:45:25.8554757Z 2025-05-07T19:45:25.8554760Z 2025-05-07T19:45:25.8554771Z 2025-05-07T19:45:25.8554775Z 2025-05-07T19:45:25.8554778Z 2025-05-07T19:45:25.8554782Z 2025-05-07T19:45:25.8554785Z 2025-05-07T19:45:25.8554789Z 2025-05-07T19:45:25.8554792Z 2025-05-07T19:45:25.8554796Z 2025-05-07T19:45:25.8554910Z 2025-05-07T19:45:25.8555282Z  2025-05-07T19:45:25.8555644Z 2025-05-07T19:45:25.8555857Z 2025-05-07T19:45:25.8556051Z  2025-05-07T19:45:25.8556297Z 2025-05-07T19:45:25.8556301Z 2025-05-07T19:45:25.8556483Z  2025-05-07T19:45:25.8556709Z 2025-05-07T19:45:25.8556713Z 2025-05-07T19:45:25.8556737Z 2025-05-07T19:45:25.8556928Z  2025-05-07T19:45:25.8557152Z 2025-05-07T19:45:25.8557156Z 2025-05-07T19:45:25.8557160Z 2025-05-07T19:45:25.8557164Z 2025-05-07T19:45:25.8557370Z  2025-05-07T19:45:25.8557596Z 2025-05-07T19:45:25.8557600Z 2025-05-07T19:45:25.8557603Z 2025-05-07T19:45:25.8557607Z 2025-05-07T19:45:25.8557610Z 2025-05-07T19:45:25.8557816Z  2025-05-07T19:45:25.8558067Z 2025-05-07T19:45:25.8558070Z 2025-05-07T19:45:25.8558074Z 2025-05-07T19:45:25.8558077Z 2025-05-07T19:45:25.8558081Z 2025-05-07T19:45:25.8558084Z 2025-05-07T19:45:25.8558280Z  2025-05-07T19:45:25.8558559Z 2025-05-07T19:45:25.8558563Z 2025-05-07T19:45:25.8558566Z 2025-05-07T19:45:25.8558831Z 2025-05-07T19:45:25.8558835Z 2025-05-07T19:45:25.8558839Z 2025-05-07T19:45:25.8558844Z 2025-05-07T19:45:25.8559076Z  2025-05-07T19:45:25.8559331Z 2025-05-07T19:45:25.8559335Z 2025-05-07T19:45:25.8559343Z 2025-05-07T19:45:25.8559347Z 2025-05-07T19:45:25.8559350Z 2025-05-07T19:45:25.8559353Z 2025-05-07T19:45:25.8559357Z 2025-05-07T19:45:25.8559360Z 2025-05-07T19:45:25.8559554Z  2025-05-07T19:45:25.8559790Z 2025-05-07T19:45:25.8559816Z 2025-05-07T19:45:25.8559819Z 2025-05-07T19:45:25.8559823Z 2025-05-07T19:45:25.8559842Z 2025-05-07T19:45:25.8559845Z 2025-05-07T19:45:25.8559849Z 2025-05-07T19:45:25.8559852Z 2025-05-07T19:45:25.8559856Z 2025-05-07T19:45:25.8560054Z  2025-05-07T19:45:25.8560309Z 2025-05-07T19:45:25.8560313Z 2025-05-07T19:45:25.8560320Z 2025-05-07T19:45:25.8560324Z 2025-05-07T19:45:25.8560327Z 2025-05-07T19:45:25.8560331Z 2025-05-07T19:45:25.8560334Z 2025-05-07T19:45:25.8560338Z 2025-05-07T19:45:25.8560341Z 2025-05-07T19:45:25.8560345Z 2025-05-07T19:45:25.8560554Z  2025-05-07T19:45:25.8560903Z 2025-05-07T19:45:25.8560907Z 2025-05-07T19:45:25.8560910Z 2025-05-07T19:45:25.8560914Z 2025-05-07T19:45:25.8560918Z 2025-05-07T19:45:25.8560921Z 2025-05-07T19:45:25.8560924Z 2025-05-07T19:45:25.8560928Z 2025-05-07T19:45:25.8560931Z 2025-05-07T19:45:25.8560934Z 2025-05-07T19:45:25.8560938Z 2025-05-07T19:45:25.8561162Z  2025-05-07T19:45:25.8561426Z 2025-05-07T19:45:25.8561430Z 2025-05-07T19:45:25.8561433Z 2025-05-07T19:45:25.8561437Z 2025-05-07T19:45:25.8561441Z 2025-05-07T19:45:25.8561445Z 2025-05-07T19:45:25.8561448Z 2025-05-07T19:45:25.8561451Z 2025-05-07T19:45:25.8561459Z 2025-05-07T19:45:25.8561462Z 2025-05-07T19:45:25.8561466Z 2025-05-07T19:45:25.8561470Z 2025-05-07T19:45:25.8561677Z  2025-05-07T19:45:25.8561979Z 2025-05-07T19:45:25.8561982Z 2025-05-07T19:45:25.8561992Z 2025-05-07T19:45:25.8561996Z 2025-05-07T19:45:25.8561999Z 2025-05-07T19:45:25.8562003Z 2025-05-07T19:45:25.8562006Z 2025-05-07T19:45:25.8562011Z 2025-05-07T19:45:25.8562015Z 2025-05-07T19:45:25.8562019Z 2025-05-07T19:45:25.8562022Z 2025-05-07T19:45:25.8562026Z 2025-05-07T19:45:25.8562029Z 2025-05-07T19:45:25.8562254Z  2025-05-07T19:45:25.8562502Z 2025-05-07T19:45:25.8562505Z 2025-05-07T19:45:25.8562509Z 2025-05-07T19:45:25.8562512Z 2025-05-07T19:45:25.8562515Z 2025-05-07T19:45:25.8562519Z 2025-05-07T19:45:25.8562522Z 2025-05-07T19:45:25.8562525Z 2025-05-07T19:45:25.8562529Z 2025-05-07T19:45:25.8562536Z 2025-05-07T19:45:25.8562541Z 2025-05-07T19:45:25.8562544Z 2025-05-07T19:45:25.8562574Z 2025-05-07T19:45:25.8562577Z 2025-05-07T19:45:25.8562798Z  2025-05-07T19:45:25.8563049Z 2025-05-07T19:45:25.8563084Z 2025-05-07T19:45:25.8563087Z 2025-05-07T19:45:25.8563091Z 2025-05-07T19:45:25.8563114Z 2025-05-07T19:45:25.8563117Z 2025-05-07T19:45:25.8563121Z 2025-05-07T19:45:25.8563124Z 2025-05-07T19:45:25.8563127Z 2025-05-07T19:45:25.8563131Z 2025-05-07T19:45:25.8563134Z 2025-05-07T19:45:25.8563138Z 2025-05-07T19:45:25.8563141Z 2025-05-07T19:45:25.8563145Z 2025-05-07T19:45:25.8563148Z 2025-05-07T19:45:25.8563371Z  2025-05-07T19:45:25.8563647Z 2025-05-07T19:45:25.8563650Z 2025-05-07T19:45:25.8563655Z 2025-05-07T19:45:25.8563658Z 2025-05-07T19:45:25.8563662Z 2025-05-07T19:45:25.8563721Z 2025-05-07T19:45:25.8563726Z 2025-05-07T19:45:25.8563729Z 2025-05-07T19:45:25.8563732Z 2025-05-07T19:45:25.8563736Z 2025-05-07T19:45:25.8563739Z 2025-05-07T19:45:25.8563743Z 2025-05-07T19:45:25.8563746Z 2025-05-07T19:45:25.8563749Z 2025-05-07T19:45:25.8563753Z 2025-05-07T19:45:25.8563756Z 2025-05-07T19:45:25.8564008Z  2025-05-07T19:45:25.8564268Z 2025-05-07T19:45:25.8564272Z 2025-05-07T19:45:25.8564275Z 2025-05-07T19:45:25.8564278Z 2025-05-07T19:45:25.8564282Z 2025-05-07T19:45:25.8564285Z 2025-05-07T19:45:25.8564289Z 2025-05-07T19:45:25.8564292Z 2025-05-07T19:45:25.8564296Z 2025-05-07T19:45:25.8564300Z 2025-05-07T19:45:25.8564303Z 2025-05-07T19:45:25.8564306Z 2025-05-07T19:45:25.8564310Z 2025-05-07T19:45:25.8564313Z 2025-05-07T19:45:25.8564317Z 2025-05-07T19:45:25.8564320Z 2025-05-07T19:45:25.8564358Z 2025-05-07T19:45:25.8564593Z  2025-05-07T19:45:25.8564852Z 2025-05-07T19:45:25.8564855Z 2025-05-07T19:45:25.8564858Z 2025-05-07T19:45:25.8564862Z 2025-05-07T19:45:25.8564866Z 2025-05-07T19:45:25.8564870Z 2025-05-07T19:45:25.8564891Z 2025-05-07T19:45:25.8564894Z 2025-05-07T19:45:25.8564952Z 2025-05-07T19:45:25.8564955Z 2025-05-07T19:45:25.8564959Z 2025-05-07T19:45:25.8564962Z 2025-05-07T19:45:25.8564965Z 2025-05-07T19:45:25.8564969Z 2025-05-07T19:45:25.8564973Z 2025-05-07T19:45:25.8564976Z 2025-05-07T19:45:25.8564979Z 2025-05-07T19:45:25.8564982Z 2025-05-07T19:45:25.8565227Z  2025-05-07T19:45:25.8565505Z 2025-05-07T19:45:25.8565508Z 2025-05-07T19:45:25.8565610Z  2025-05-07T19:45:25.8565722Z 2025-05-07T19:45:25.8565726Z 2025-05-07T19:45:25.8565864Z  2025-05-07T19:45:25.8565982Z 2025-05-07T19:45:25.8565986Z 2025-05-07T19:45:25.8565989Z 2025-05-07T19:45:25.8566108Z  2025-05-07T19:45:25.8566241Z 2025-05-07T19:45:25.8566244Z 2025-05-07T19:45:25.8566248Z 2025-05-07T19:45:25.8566251Z 2025-05-07T19:45:25.8566501Z  2025-05-07T19:45:25.8566630Z 2025-05-07T19:45:25.8566644Z 2025-05-07T19:45:25.8566648Z 2025-05-07T19:45:25.8566675Z 2025-05-07T19:45:25.8566678Z 2025-05-07T19:45:25.8567355Z  2025-05-07T19:45:25.8567539Z 2025-05-07T19:45:25.8567544Z 2025-05-07T19:45:25.8567548Z 2025-05-07T19:45:25.8567552Z 2025-05-07T19:45:25.8567556Z 2025-05-07T19:45:25.8567560Z 2025-05-07T19:45:25.8567726Z  2025-05-07T19:45:25.8567878Z 2025-05-07T19:45:25.8567883Z 2025-05-07T19:45:25.8567888Z 2025-05-07T19:45:25.8567892Z 2025-05-07T19:45:25.8567896Z 2025-05-07T19:45:25.8567901Z 2025-05-07T19:45:25.8567906Z 2025-05-07T19:45:25.8568040Z  2025-05-07T19:45:25.8568220Z 2025-05-07T19:45:25.8568223Z 2025-05-07T19:45:25.8568228Z 2025-05-07T19:45:25.8568231Z 2025-05-07T19:45:25.8568236Z 2025-05-07T19:45:25.8568260Z 2025-05-07T19:45:25.8568264Z 2025-05-07T19:45:25.8568267Z 2025-05-07T19:45:25.8568408Z  2025-05-07T19:45:25.8568592Z 2025-05-07T19:45:25.8568596Z 2025-05-07T19:45:25.8568600Z 2025-05-07T19:45:25.8568604Z 2025-05-07T19:45:25.8568615Z 2025-05-07T19:45:25.8568631Z 2025-05-07T19:45:25.8568634Z 2025-05-07T19:45:25.8568638Z 2025-05-07T19:45:25.8568642Z 2025-05-07T19:45:25.8568952Z  2025-05-07T19:45:25.8569144Z 2025-05-07T19:45:25.8569149Z 2025-05-07T19:45:25.8569168Z 2025-05-07T19:45:25.8569175Z 2025-05-07T19:45:25.8569178Z 2025-05-07T19:45:25.8569181Z 2025-05-07T19:45:25.8569185Z 2025-05-07T19:45:25.8569209Z 2025-05-07T19:45:25.8569212Z 2025-05-07T19:45:25.8569215Z 2025-05-07T19:45:25.8569351Z  2025-05-07T19:45:25.8569530Z 2025-05-07T19:45:25.8569534Z 2025-05-07T19:45:25.8569537Z 2025-05-07T19:45:25.8569540Z 2025-05-07T19:45:25.8569558Z 2025-05-07T19:45:25.8569562Z 2025-05-07T19:45:25.8569675Z 2025-05-07T19:45:25.8569682Z 2025-05-07T19:45:25.8569686Z 2025-05-07T19:45:25.8569689Z 2025-05-07T19:45:25.8569693Z 2025-05-07T19:45:25.8569849Z  2025-05-07T19:45:25.8570038Z 2025-05-07T19:45:25.8570042Z 2025-05-07T19:45:25.8570046Z 2025-05-07T19:45:25.8570059Z 2025-05-07T19:45:25.8570064Z 2025-05-07T19:45:25.8570083Z 2025-05-07T19:45:25.8570087Z 2025-05-07T19:45:25.8570090Z 2025-05-07T19:45:25.8570109Z 2025-05-07T19:45:25.8570113Z 2025-05-07T19:45:25.8570118Z 2025-05-07T19:45:25.8570121Z 2025-05-07T19:45:25.8570259Z  2025-05-07T19:45:25.8570452Z 2025-05-07T19:45:25.8570456Z 2025-05-07T19:45:25.8570476Z 2025-05-07T19:45:25.8570479Z 2025-05-07T19:45:25.8570483Z 2025-05-07T19:45:25.8570486Z 2025-05-07T19:45:25.8570489Z 2025-05-07T19:45:25.8570493Z 2025-05-07T19:45:25.8570496Z 2025-05-07T19:45:25.8570500Z 2025-05-07T19:45:25.8570512Z 2025-05-07T19:45:25.8570516Z 2025-05-07T19:45:25.8570521Z 2025-05-07T19:45:25.8570726Z  2025-05-07T19:45:25.8570947Z 2025-05-07T19:45:25.8570951Z 2025-05-07T19:45:25.8570954Z 2025-05-07T19:45:25.8570973Z 2025-05-07T19:45:25.8570977Z 2025-05-07T19:45:25.8570983Z 2025-05-07T19:45:25.8570987Z 2025-05-07T19:45:25.8570990Z 2025-05-07T19:45:25.8571053Z 2025-05-07T19:45:25.8571056Z 2025-05-07T19:45:25.8571060Z 2025-05-07T19:45:25.8571063Z 2025-05-07T19:45:25.8571067Z 2025-05-07T19:45:25.8571070Z 2025-05-07T19:45:25.8571239Z  2025-05-07T19:45:25.8571472Z 2025-05-07T19:45:25.8571476Z 2025-05-07T19:45:25.8571479Z 2025-05-07T19:45:25.8571483Z 2025-05-07T19:45:25.8571486Z 2025-05-07T19:45:25.8571489Z 2025-05-07T19:45:25.8571493Z 2025-05-07T19:45:25.8571496Z 2025-05-07T19:45:25.8571500Z 2025-05-07T19:45:25.8571503Z 2025-05-07T19:45:25.8571506Z 2025-05-07T19:45:25.8571512Z 2025-05-07T19:45:25.8571515Z 2025-05-07T19:45:25.8571519Z 2025-05-07T19:45:25.8571538Z 2025-05-07T19:45:25.8571707Z  2025-05-07T19:45:25.8571923Z 2025-05-07T19:45:25.8571927Z 2025-05-07T19:45:25.8571930Z 2025-05-07T19:45:25.8571934Z 2025-05-07T19:45:25.8571937Z 2025-05-07T19:45:25.8571954Z 2025-05-07T19:45:25.8571958Z 2025-05-07T19:45:25.8571980Z 2025-05-07T19:45:25.8571994Z 2025-05-07T19:45:25.8571997Z 2025-05-07T19:45:25.8572000Z 2025-05-07T19:45:25.8572004Z 2025-05-07T19:45:25.8572009Z 2025-05-07T19:45:25.8572012Z 2025-05-07T19:45:25.8572015Z 2025-05-07T19:45:25.8572019Z 2025-05-07T19:45:25.8572183Z  2025-05-07T19:45:25.8572403Z 2025-05-07T19:45:25.8572407Z 2025-05-07T19:45:25.8572427Z 2025-05-07T19:45:25.8572431Z 2025-05-07T19:45:25.8572434Z 2025-05-07T19:45:25.8572438Z 2025-05-07T19:45:25.8572471Z 2025-05-07T19:45:25.8572474Z 2025-05-07T19:45:25.8572478Z 2025-05-07T19:45:25.8572481Z 2025-05-07T19:45:25.8572485Z 2025-05-07T19:45:25.8572488Z 2025-05-07T19:45:25.8572491Z 2025-05-07T19:45:25.8572500Z 2025-05-07T19:45:25.8572505Z 2025-05-07T19:45:25.8572510Z 2025-05-07T19:45:25.8572513Z 2025-05-07T19:45:25.8572681Z  2025-05-07T19:45:25.8572922Z 2025-05-07T19:45:25.8572926Z 2025-05-07T19:45:25.8572929Z 2025-05-07T19:45:25.8572933Z 2025-05-07T19:45:25.8572942Z 2025-05-07T19:45:25.8572945Z 2025-05-07T19:45:25.8572949Z 2025-05-07T19:45:25.8572952Z 2025-05-07T19:45:25.8572955Z 2025-05-07T19:45:25.8572958Z 2025-05-07T19:45:25.8572962Z 2025-05-07T19:45:25.8572965Z 2025-05-07T19:45:25.8572968Z 2025-05-07T19:45:25.8572982Z 2025-05-07T19:45:25.8572986Z 2025-05-07T19:45:25.8572990Z 2025-05-07T19:45:25.8573010Z 2025-05-07T19:45:25.8573013Z 2025-05-07T19:45:25.8573342Z  2025-05-07T19:45:25.8573572Z 2025-05-07T19:45:25.8573575Z 2025-05-07T19:45:25.8573712Z  2025-05-07T19:45:25.8573823Z 2025-05-07T19:45:25.8573836Z 2025-05-07T19:45:25.8574273Z  2025-05-07T19:45:25.8574419Z 2025-05-07T19:45:25.8574615Z 2025-05-07T19:45:25.8574622Z 2025-05-07T19:45:25.8574757Z  2025-05-07T19:45:25.8574881Z 2025-05-07T19:45:25.8574886Z 2025-05-07T19:45:25.8574890Z 2025-05-07T19:45:25.8574893Z 2025-05-07T19:45:25.8575042Z  2025-05-07T19:45:25.8575172Z 2025-05-07T19:45:25.8575188Z 2025-05-07T19:45:25.8575201Z 2025-05-07T19:45:25.8575205Z 2025-05-07T19:45:25.8575208Z 2025-05-07T19:45:25.8575565Z  2025-05-07T19:45:25.8575730Z 2025-05-07T19:45:25.8575747Z 2025-05-07T19:45:25.8575751Z 2025-05-07T19:45:25.8575755Z 2025-05-07T19:45:25.8575758Z 2025-05-07T19:45:25.8575762Z 2025-05-07T19:45:25.8575883Z  2025-05-07T19:45:25.8576024Z 2025-05-07T19:45:25.8576027Z 2025-05-07T19:45:25.8576031Z 2025-05-07T19:45:25.8576052Z 2025-05-07T19:45:25.8576064Z 2025-05-07T19:45:25.8576068Z 2025-05-07T19:45:25.8576072Z 2025-05-07T19:45:25.8576283Z  2025-05-07T19:45:25.8576435Z 2025-05-07T19:45:25.8576451Z 2025-05-07T19:45:25.8576466Z 2025-05-07T19:45:25.8576469Z 2025-05-07T19:45:25.8576489Z 2025-05-07T19:45:25.8576493Z 2025-05-07T19:45:25.8576496Z 2025-05-07T19:45:25.8576500Z 2025-05-07T19:45:25.8576735Z  2025-05-07T19:45:25.8576898Z 2025-05-07T19:45:25.8576914Z 2025-05-07T19:45:25.8576918Z 2025-05-07T19:45:25.8576990Z 2025-05-07T19:45:25.8576994Z 2025-05-07T19:45:25.8577025Z 2025-05-07T19:45:25.8577029Z 2025-05-07T19:45:25.8577032Z 2025-05-07T19:45:25.8577037Z 2025-05-07T19:45:25.8577179Z  2025-05-07T19:45:25.8577345Z 2025-05-07T19:45:25.8577349Z 2025-05-07T19:45:25.8577353Z 2025-05-07T19:45:25.8577369Z 2025-05-07T19:45:25.8577373Z 2025-05-07T19:45:25.8577394Z 2025-05-07T19:45:25.8577398Z 2025-05-07T19:45:25.8577401Z 2025-05-07T19:45:25.8577405Z 2025-05-07T19:45:25.8577408Z 2025-05-07T19:45:25.8577539Z  2025-05-07T19:45:25.8577712Z 2025-05-07T19:45:25.8577715Z 2025-05-07T19:45:25.8577719Z 2025-05-07T19:45:25.8577722Z 2025-05-07T19:45:25.8577731Z 2025-05-07T19:45:25.8577761Z 2025-05-07T19:45:25.8577767Z 2025-05-07T19:45:25.8577770Z 2025-05-07T19:45:25.8577774Z 2025-05-07T19:45:25.8577777Z 2025-05-07T19:45:25.8577782Z 2025-05-07T19:45:25.8577917Z  2025-05-07T19:45:25.8578116Z 2025-05-07T19:45:25.8578124Z 2025-05-07T19:45:25.8578127Z 2025-05-07T19:45:25.8578147Z 2025-05-07T19:45:25.8578150Z 2025-05-07T19:45:25.8578154Z 2025-05-07T19:45:25.8578157Z 2025-05-07T19:45:25.8578160Z 2025-05-07T19:45:25.8578164Z 2025-05-07T19:45:25.8578167Z 2025-05-07T19:45:25.8578170Z 2025-05-07T19:45:25.8578174Z 2025-05-07T19:45:25.8578776Z  2025-05-07T19:45:25.8579029Z 2025-05-07T19:45:25.8579035Z 2025-05-07T19:45:25.8579040Z 2025-05-07T19:45:25.8579045Z 2025-05-07T19:45:25.8579050Z 2025-05-07T19:45:25.8579086Z 2025-05-07T19:45:25.8579109Z 2025-05-07T19:45:25.8579113Z 2025-05-07T19:45:25.8579120Z 2025-05-07T19:45:25.8579125Z 2025-05-07T19:45:25.8579129Z 2025-05-07T19:45:25.8579157Z 2025-05-07T19:45:25.8579161Z 2025-05-07T19:45:25.8579343Z  2025-05-07T19:45:25.8579550Z 2025-05-07T19:45:25.8579554Z 2025-05-07T19:45:25.8579558Z 2025-05-07T19:45:25.8579582Z 2025-05-07T19:45:25.8579585Z 2025-05-07T19:45:25.8579606Z 2025-05-07T19:45:25.8579609Z 2025-05-07T19:45:25.8579613Z 2025-05-07T19:45:25.8579617Z 2025-05-07T19:45:25.8579621Z 2025-05-07T19:45:25.8579624Z 2025-05-07T19:45:25.8579628Z 2025-05-07T19:45:25.8579631Z 2025-05-07T19:45:25.8579635Z 2025-05-07T19:45:25.8579789Z  2025-05-07T19:45:25.8580025Z 2025-05-07T19:45:25.8580029Z 2025-05-07T19:45:25.8580032Z 2025-05-07T19:45:25.8580035Z 2025-05-07T19:45:25.8580039Z 2025-05-07T19:45:25.8580042Z 2025-05-07T19:45:25.8580046Z 2025-05-07T19:45:25.8580049Z 2025-05-07T19:45:25.8580052Z 2025-05-07T19:45:25.8580079Z 2025-05-07T19:45:25.8580082Z 2025-05-07T19:45:25.8580086Z 2025-05-07T19:45:25.8580090Z 2025-05-07T19:45:25.8580268Z 2025-05-07T19:45:25.8580275Z 2025-05-07T19:45:25.8580457Z  2025-05-07T19:45:25.8580676Z 2025-05-07T19:45:25.8580679Z 2025-05-07T19:45:25.8580682Z 2025-05-07T19:45:25.8580686Z 2025-05-07T19:45:25.8580689Z 2025-05-07T19:45:25.8580693Z 2025-05-07T19:45:25.8580701Z 2025-05-07T19:45:25.8580705Z 2025-05-07T19:45:25.8580709Z 2025-05-07T19:45:25.8580713Z 2025-05-07T19:45:25.8580716Z 2025-05-07T19:45:25.8580719Z 2025-05-07T19:45:25.8580723Z 2025-05-07T19:45:25.8580726Z 2025-05-07T19:45:25.8580729Z 2025-05-07T19:45:25.8580749Z 2025-05-07T19:45:25.8580913Z  2025-05-07T19:45:25.8581134Z 2025-05-07T19:45:25.8581138Z 2025-05-07T19:45:25.8581142Z 2025-05-07T19:45:25.8581145Z 2025-05-07T19:45:25.8581149Z 2025-05-07T19:45:25.8581152Z 2025-05-07T19:45:25.8581156Z 2025-05-07T19:45:25.8581159Z 2025-05-07T19:45:25.8581163Z 2025-05-07T19:45:25.8581166Z 2025-05-07T19:45:25.8581186Z 2025-05-07T19:45:25.8581195Z 2025-05-07T19:45:25.8581198Z 2025-05-07T19:45:25.8581202Z 2025-05-07T19:45:25.8581205Z 2025-05-07T19:45:25.8581210Z 2025-05-07T19:45:25.8581213Z 2025-05-07T19:45:25.8581400Z  2025-05-07T19:45:25.8581628Z 2025-05-07T19:45:25.8581648Z 2025-05-07T19:45:25.8581747Z 2025-05-07T19:45:25.8581751Z 2025-05-07T19:45:25.8581754Z 2025-05-07T19:45:25.8581758Z 2025-05-07T19:45:25.8581761Z 2025-05-07T19:45:25.8581764Z 2025-05-07T19:45:25.8581768Z 2025-05-07T19:45:25.8581771Z 2025-05-07T19:45:25.8581774Z 2025-05-07T19:45:25.8581778Z 2025-05-07T19:45:25.8581781Z 2025-05-07T19:45:25.8581784Z 2025-05-07T19:45:25.8581788Z 2025-05-07T19:45:25.8581791Z 2025-05-07T19:45:25.8581794Z 2025-05-07T19:45:25.8581798Z 2025-05-07T19:45:25.8581977Z  2025-05-07T19:45:25.8582225Z 2025-05-07T19:45:25.8582229Z 2025-05-07T19:45:25.8582332Z  2025-05-07T19:45:25.8582483Z 2025-05-07T19:45:25.8582489Z 2025-05-07T19:45:25.8582599Z  2025-05-07T19:45:25.8582719Z 2025-05-07T19:45:25.8582723Z 2025-05-07T19:45:25.8582726Z 2025-05-07T19:45:25.8582837Z  2025-05-07T19:45:25.8582978Z 2025-05-07T19:45:25.8582982Z 2025-05-07T19:45:25.8582986Z 2025-05-07T19:45:25.8582989Z 2025-05-07T19:45:25.8583102Z  2025-05-07T19:45:25.8583231Z 2025-05-07T19:45:25.8583252Z 2025-05-07T19:45:25.8583255Z 2025-05-07T19:45:25.8583259Z 2025-05-07T19:45:25.8583262Z 2025-05-07T19:45:25.8583394Z  2025-05-07T19:45:25.8583524Z 2025-05-07T19:45:25.8583527Z 2025-05-07T19:45:25.8583531Z 2025-05-07T19:45:25.8583534Z 2025-05-07T19:45:25.8583539Z 2025-05-07T19:45:25.8583558Z 2025-05-07T19:45:25.8583671Z  2025-05-07T19:45:25.8583805Z 2025-05-07T19:45:25.8583809Z 2025-05-07T19:45:25.8583812Z 2025-05-07T19:45:25.8583816Z 2025-05-07T19:45:25.8583819Z 2025-05-07T19:45:25.8583822Z 2025-05-07T19:45:25.8583826Z 2025-05-07T19:45:25.8583973Z  2025-05-07T19:45:25.8584123Z 2025-05-07T19:45:25.8584127Z 2025-05-07T19:45:25.8584130Z 2025-05-07T19:45:25.8584133Z 2025-05-07T19:45:25.8584137Z 2025-05-07T19:45:25.8584140Z 2025-05-07T19:45:25.8584144Z 2025-05-07T19:45:25.8584147Z 2025-05-07T19:45:25.8584307Z  2025-05-07T19:45:25.8584474Z 2025-05-07T19:45:25.8584478Z 2025-05-07T19:45:25.8584482Z 2025-05-07T19:45:25.8584485Z 2025-05-07T19:45:25.8584488Z 2025-05-07T19:45:25.8584492Z 2025-05-07T19:45:25.8584497Z 2025-05-07T19:45:25.8584501Z 2025-05-07T19:45:25.8584504Z 2025-05-07T19:45:25.8584649Z  2025-05-07T19:45:25.8584813Z 2025-05-07T19:45:25.8584817Z 2025-05-07T19:45:25.8584821Z 2025-05-07T19:45:25.8584824Z 2025-05-07T19:45:25.8584828Z 2025-05-07T19:45:25.8584831Z 2025-05-07T19:45:25.8584834Z 2025-05-07T19:45:25.8584838Z 2025-05-07T19:45:25.8584841Z 2025-05-07T19:45:25.8584844Z 2025-05-07T19:45:25.8584995Z  2025-05-07T19:45:25.8585169Z 2025-05-07T19:45:25.8585239Z 2025-05-07T19:45:25.8585243Z 2025-05-07T19:45:25.8585246Z 2025-05-07T19:45:25.8585250Z 2025-05-07T19:45:25.8585253Z 2025-05-07T19:45:25.8585256Z 2025-05-07T19:45:25.8585260Z 2025-05-07T19:45:25.8585263Z 2025-05-07T19:45:25.8585266Z 2025-05-07T19:45:25.8585270Z 2025-05-07T19:45:25.8585465Z  2025-05-07T19:45:25.8585885Z 2025-05-07T19:45:25.8585891Z 2025-05-07T19:45:25.8585896Z 2025-05-07T19:45:25.8585902Z 2025-05-07T19:45:25.8585908Z 2025-05-07T19:45:25.8585914Z 2025-05-07T19:45:25.8585919Z 2025-05-07T19:45:25.8585924Z 2025-05-07T19:45:25.8585930Z 2025-05-07T19:45:25.8585934Z 2025-05-07T19:45:25.8585940Z 2025-05-07T19:45:25.8585943Z 2025-05-07T19:45:25.8586118Z  2025-05-07T19:45:25.8586324Z 2025-05-07T19:45:25.8586328Z 2025-05-07T19:45:25.8586331Z 2025-05-07T19:45:25.8586334Z 2025-05-07T19:45:25.8586338Z 2025-05-07T19:45:25.8586341Z 2025-05-07T19:45:25.8586344Z 2025-05-07T19:45:25.8586348Z 2025-05-07T19:45:25.8586357Z 2025-05-07T19:45:25.8586360Z 2025-05-07T19:45:25.8586384Z 2025-05-07T19:45:25.8586388Z 2025-05-07T19:45:25.8586391Z 2025-05-07T19:45:25.8586534Z  2025-05-07T19:45:25.8586738Z 2025-05-07T19:45:25.8586741Z 2025-05-07T19:45:25.8586745Z 2025-05-07T19:45:25.8586889Z 2025-05-07T19:45:25.8586893Z 2025-05-07T19:45:25.8586896Z 2025-05-07T19:45:25.8586900Z 2025-05-07T19:45:25.8586903Z 2025-05-07T19:45:25.8586925Z 2025-05-07T19:45:25.8586929Z 2025-05-07T19:45:25.8586932Z 2025-05-07T19:45:25.8586935Z 2025-05-07T19:45:25.8586939Z 2025-05-07T19:45:25.8586942Z 2025-05-07T19:45:25.8587097Z  2025-05-07T19:45:25.8587307Z 2025-05-07T19:45:25.8587310Z 2025-05-07T19:45:25.8587314Z 2025-05-07T19:45:25.8587317Z 2025-05-07T19:45:25.8587320Z 2025-05-07T19:45:25.8587343Z 2025-05-07T19:45:25.8587347Z 2025-05-07T19:45:25.8587350Z 2025-05-07T19:45:25.8587354Z 2025-05-07T19:45:25.8587357Z 2025-05-07T19:45:25.8587360Z 2025-05-07T19:45:25.8587368Z 2025-05-07T19:45:25.8587372Z 2025-05-07T19:45:25.8587375Z 2025-05-07T19:45:25.8587379Z 2025-05-07T19:45:25.8587536Z  2025-05-07T19:45:25.8587771Z 2025-05-07T19:45:25.8587818Z 2025-05-07T19:45:25.8587821Z 2025-05-07T19:45:25.8587829Z 2025-05-07T19:45:25.8587833Z 2025-05-07T19:45:25.8587837Z 2025-05-07T19:45:25.8587840Z 2025-05-07T19:45:25.8587844Z 2025-05-07T19:45:25.8587847Z 2025-05-07T19:45:25.8587850Z 2025-05-07T19:45:25.8587854Z 2025-05-07T19:45:25.8587857Z 2025-05-07T19:45:25.8587860Z 2025-05-07T19:45:25.8587864Z 2025-05-07T19:45:25.8587867Z 2025-05-07T19:45:25.8587873Z 2025-05-07T19:45:25.8588058Z  2025-05-07T19:45:25.8588280Z 2025-05-07T19:45:25.8588285Z 2025-05-07T19:45:25.8588288Z 2025-05-07T19:45:25.8588291Z 2025-05-07T19:45:25.8588295Z 2025-05-07T19:45:25.8588298Z 2025-05-07T19:45:25.8588301Z 2025-05-07T19:45:25.8588305Z 2025-05-07T19:45:25.8588308Z 2025-05-07T19:45:25.8588315Z 2025-05-07T19:45:25.8588319Z 2025-05-07T19:45:25.8588323Z 2025-05-07T19:45:25.8588347Z 2025-05-07T19:45:25.8588351Z 2025-05-07T19:45:25.8588354Z 2025-05-07T19:45:25.8588357Z 2025-05-07T19:45:25.8588361Z 2025-05-07T19:45:25.8588527Z  2025-05-07T19:45:25.8588758Z 2025-05-07T19:45:25.8588761Z 2025-05-07T19:45:25.8588765Z 2025-05-07T19:45:25.8588768Z 2025-05-07T19:45:25.8588772Z 2025-05-07T19:45:25.8588795Z 2025-05-07T19:45:25.8588798Z 2025-05-07T19:45:25.8588802Z 2025-05-07T19:45:25.8588805Z 2025-05-07T19:45:25.8588808Z 2025-05-07T19:45:25.8588812Z 2025-05-07T19:45:25.8588815Z 2025-05-07T19:45:25.8588818Z 2025-05-07T19:45:25.8588822Z 2025-05-07T19:45:25.8588825Z 2025-05-07T19:45:25.8588829Z 2025-05-07T19:45:25.8588833Z 2025-05-07T19:45:25.8588836Z 2025-05-07T19:45:25.8589011Z  2025-05-07T19:45:25.8589258Z 2025-05-07T19:45:25.8589262Z 2025-05-07T19:45:25.8589448Z  2025-05-07T19:45:25.8589571Z 2025-05-07T19:45:25.8589574Z 2025-05-07T19:45:25.8589701Z  2025-05-07T19:45:25.8589814Z 2025-05-07T19:45:25.8589818Z 2025-05-07T19:45:25.8589822Z 2025-05-07T19:45:25.8589928Z  2025-05-07T19:45:25.8590067Z 2025-05-07T19:45:25.8590071Z 2025-05-07T19:45:25.8590078Z 2025-05-07T19:45:25.8590082Z 2025-05-07T19:45:25.8590193Z  2025-05-07T19:45:25.8590318Z 2025-05-07T19:45:25.8590322Z 2025-05-07T19:45:25.8590326Z 2025-05-07T19:45:25.8590329Z 2025-05-07T19:45:25.8590333Z 2025-05-07T19:45:25.8590547Z  2025-05-07T19:45:25.8590680Z 2025-05-07T19:45:25.8590683Z 2025-05-07T19:45:25.8590687Z 2025-05-07T19:45:25.8590691Z 2025-05-07T19:45:25.8590695Z 2025-05-07T19:45:25.8590699Z 2025-05-07T19:45:25.8590838Z  2025-05-07T19:45:25.8590979Z 2025-05-07T19:45:25.8590983Z 2025-05-07T19:45:25.8590986Z 2025-05-07T19:45:25.8590990Z 2025-05-07T19:45:25.8590994Z 2025-05-07T19:45:25.8590997Z 2025-05-07T19:45:25.8591004Z 2025-05-07T19:45:25.8591158Z  2025-05-07T19:45:25.8591312Z 2025-05-07T19:45:25.8591316Z 2025-05-07T19:45:25.8591319Z 2025-05-07T19:45:25.8591322Z 2025-05-07T19:45:25.8591326Z 2025-05-07T19:45:25.8591329Z 2025-05-07T19:45:25.8591333Z 2025-05-07T19:45:25.8591336Z 2025-05-07T19:45:25.8591521Z  2025-05-07T19:45:25.8591703Z 2025-05-07T19:45:25.8591707Z 2025-05-07T19:45:25.8591710Z 2025-05-07T19:45:25.8591714Z 2025-05-07T19:45:25.8591718Z 2025-05-07T19:45:25.8591721Z 2025-05-07T19:45:25.8591724Z 2025-05-07T19:45:25.8591728Z 2025-05-07T19:45:25.8591731Z 2025-05-07T19:45:25.8591859Z  2025-05-07T19:45:25.8592049Z 2025-05-07T19:45:25.8592053Z 2025-05-07T19:45:25.8592056Z 2025-05-07T19:45:25.8592060Z 2025-05-07T19:45:25.8592063Z 2025-05-07T19:45:25.8592067Z 2025-05-07T19:45:25.8592070Z 2025-05-07T19:45:25.8592073Z 2025-05-07T19:45:25.8592077Z 2025-05-07T19:45:25.8592080Z 2025-05-07T19:45:25.8592216Z  2025-05-07T19:45:25.8592414Z 2025-05-07T19:45:25.8592418Z 2025-05-07T19:45:25.8592421Z 2025-05-07T19:45:25.8592425Z 2025-05-07T19:45:25.8592428Z 2025-05-07T19:45:25.8592431Z 2025-05-07T19:45:25.8592435Z 2025-05-07T19:45:25.8592438Z 2025-05-07T19:45:25.8592441Z 2025-05-07T19:45:25.8592449Z 2025-05-07T19:45:25.8592452Z 2025-05-07T19:45:25.8592670Z  2025-05-07T19:45:25.8592880Z 2025-05-07T19:45:25.8592883Z 2025-05-07T19:45:25.8592887Z 2025-05-07T19:45:25.8592890Z 2025-05-07T19:45:25.8592895Z 2025-05-07T19:45:25.8592898Z 2025-05-07T19:45:25.8592902Z 2025-05-07T19:45:25.8592905Z 2025-05-07T19:45:25.8592908Z 2025-05-07T19:45:25.8592911Z 2025-05-07T19:45:25.8592915Z 2025-05-07T19:45:25.8592918Z 2025-05-07T19:45:25.8593062Z  2025-05-07T19:45:25.8593282Z 2025-05-07T19:45:25.8593286Z 2025-05-07T19:45:25.8593289Z 2025-05-07T19:45:25.8593293Z 2025-05-07T19:45:25.8593296Z 2025-05-07T19:45:25.8593300Z 2025-05-07T19:45:25.8593307Z 2025-05-07T19:45:25.8593311Z 2025-05-07T19:45:25.8593315Z 2025-05-07T19:45:25.8593318Z 2025-05-07T19:45:25.8593322Z 2025-05-07T19:45:25.8593325Z 2025-05-07T19:45:25.8593328Z 2025-05-07T19:45:25.8593474Z  2025-05-07T19:45:25.8593700Z 2025-05-07T19:45:25.8593707Z 2025-05-07T19:45:25.8593711Z 2025-05-07T19:45:25.8593714Z 2025-05-07T19:45:25.8593718Z 2025-05-07T19:45:25.8593721Z 2025-05-07T19:45:25.8593725Z 2025-05-07T19:45:25.8593728Z 2025-05-07T19:45:25.8593731Z 2025-05-07T19:45:25.8593735Z 2025-05-07T19:45:25.8593738Z 2025-05-07T19:45:25.8593741Z 2025-05-07T19:45:25.8593745Z 2025-05-07T19:45:25.8593748Z 2025-05-07T19:45:25.8593919Z  2025-05-07T19:45:25.8594124Z 2025-05-07T19:45:25.8594128Z 2025-05-07T19:45:25.8594131Z 2025-05-07T19:45:25.8594135Z 2025-05-07T19:45:25.8594138Z 2025-05-07T19:45:25.8594141Z 2025-05-07T19:45:25.8594145Z 2025-05-07T19:45:25.8594148Z 2025-05-07T19:45:25.8594238Z 2025-05-07T19:45:25.8594243Z 2025-05-07T19:45:25.8594246Z 2025-05-07T19:45:25.8594249Z 2025-05-07T19:45:25.8594253Z 2025-05-07T19:45:25.8594256Z 2025-05-07T19:45:25.8594275Z 2025-05-07T19:45:25.8594429Z  2025-05-07T19:45:25.8594641Z 2025-05-07T19:45:25.8594649Z 2025-05-07T19:45:25.8594652Z 2025-05-07T19:45:25.8594655Z 2025-05-07T19:45:25.8594659Z 2025-05-07T19:45:25.8594662Z 2025-05-07T19:45:25.8594708Z 2025-05-07T19:45:25.8594711Z 2025-05-07T19:45:25.8594715Z 2025-05-07T19:45:25.8594718Z 2025-05-07T19:45:25.8594722Z 2025-05-07T19:45:25.8594725Z 2025-05-07T19:45:25.8594728Z 2025-05-07T19:45:25.8594732Z 2025-05-07T19:45:25.8594735Z 2025-05-07T19:45:25.8594738Z 2025-05-07T19:45:25.8594896Z  2025-05-07T19:45:25.8595138Z 2025-05-07T19:45:25.8595141Z 2025-05-07T19:45:25.8595145Z 2025-05-07T19:45:25.8595148Z 2025-05-07T19:45:25.8595151Z 2025-05-07T19:45:25.8595155Z 2025-05-07T19:45:25.8595162Z 2025-05-07T19:45:25.8595165Z 2025-05-07T19:45:25.8595168Z 2025-05-07T19:45:25.8595172Z 2025-05-07T19:45:25.8595175Z 2025-05-07T19:45:25.8595178Z 2025-05-07T19:45:25.8595182Z 2025-05-07T19:45:25.8595185Z 2025-05-07T19:45:25.8595189Z 2025-05-07T19:45:25.8595192Z 2025-05-07T19:45:25.8595195Z 2025-05-07T19:45:25.8595421Z  2025-05-07T19:45:25.8595663Z 2025-05-07T19:45:25.8595666Z 2025-05-07T19:45:25.8595670Z 2025-05-07T19:45:25.8595673Z 2025-05-07T19:45:25.8595676Z 2025-05-07T19:45:25.8595680Z 2025-05-07T19:45:25.8595684Z 2025-05-07T19:45:25.8595687Z 2025-05-07T19:45:25.8595690Z 2025-05-07T19:45:25.8595694Z 2025-05-07T19:45:25.8595698Z 2025-05-07T19:45:25.8595701Z 2025-05-07T19:45:25.8595704Z 2025-05-07T19:45:25.8595707Z 2025-05-07T19:45:25.8595711Z 2025-05-07T19:45:25.8595714Z 2025-05-07T19:45:25.8595718Z 2025-05-07T19:45:25.8595721Z 2025-05-07T19:45:25.8595909Z  2025-05-07T19:45:25.8596140Z 2025-05-07T19:45:25.8596144Z 2025-05-07T19:45:25.8596245Z  2025-05-07T19:45:25.8611840Z 2025-05-07T19:45:25.8611888Z 2025-05-07T19:45:25.8612143Z  2025-05-07T19:45:25.8612294Z 2025-05-07T19:45:25.8612297Z 2025-05-07T19:45:25.8612301Z 2025-05-07T19:45:25.8612436Z  done 2025-05-07T19:45:26.1703123Z Preparing transaction: | / - done 2025-05-07T19:45:29.8627185Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:32.5913085Z Executing transaction: \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:33.0145770Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:34.8566008Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:34.8567753Z 2025-05-07T19:45:34.8580899Z 2025-05-07T19:45:34.8604462Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:37.2088423Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:37.2090042Z 2025-05-07T19:45:37.2090161Z Collecting build 2025-05-07T19:45:37.2090541Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:37.2091393Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build) (25.0) 2025-05-07T19:45:37.2092157Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:37.2092627Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:37.2093384Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:37.2093862Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:37.2094313Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:37.2094600Z 2025-05-07T19:45:37.2094801Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:37.2095119Z 2025-05-07T19:45:39.0892786Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:39.0893270Z 2025-05-07T19:45:39.1476483Z [CHECK] Binary make found in PATH 2025-05-07T19:45:40.9341760Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:40.9342075Z 2025-05-07T19:45:40.9922879Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:42.8017730Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:42.8018071Z 2025-05-07T19:45:42.8774193Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:44.7739433Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:46.8093748Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:48.7419933Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:50.7588606Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:52.6750965Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:52.6752211Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:52.6822992Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:52.6823509Z . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:52.6824138Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:52.6824477Z env: 2025-05-07T19:45:52.6824697Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:52.6825014Z BUILD_ENV: build_binary 2025-05-07T19:45:52.6825257Z BUILD_TARGET: default 2025-05-07T19:45:52.6825507Z BUILD_VARIANT: cuda 2025-05-07T19:45:52.6825736Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:52.6825996Z ##[endgroup] 2025-05-07T19:45:53.1434598Z ################################################################################ 2025-05-07T19:45:53.1434973Z # Install CUDA 2025-05-07T19:45:53.1435215Z # 2025-05-07T19:45:53.1452155Z # [2025-05-07T19:45:53.144Z] + install_cuda build_binary 11.8.0 2025-05-07T19:45:53.1452610Z ################################################################################ 2025-05-07T19:45:53.1452937Z 2025-05-07T19:45:53.1473311Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:53.2325772Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:53.2326876Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:53.2328273Z + conda clean --packages --tarball -y 2025-05-07T19:45:53.2328896Z 2025-05-07T19:45:53.7751357Z Will remove 147 (628.1 MB) tarball(s). 2025-05-07T19:45:53.7752322Z Will remove 21 (102.9 MB) package(s). 2025-05-07T19:45:53.8318929Z 2025-05-07T19:45:53.8323325Z + conda clean --all -y 2025-05-07T19:45:53.8324056Z 2025-05-07T19:45:54.4437294Z There are no unused tarball(s) to remove. 2025-05-07T19:45:54.4438313Z Will remove 1 index cache(s). 2025-05-07T19:45:54.4439186Z There are no unused package(s) to remove. 2025-05-07T19:45:54.4440112Z There are no tempfile(s) to remove. 2025-05-07T19:45:54.4440988Z There are no logfile(s) to remove. 2025-05-07T19:45:54.5008240Z 2025-05-07T19:45:54.5017297Z [INSTALL] Installing CUDA 11.8.0 ... 2025-05-07T19:45:54.5044652Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c nvidia/label/cuda-11.8.0 -y cuda 2025-05-07T19:45:55.5195282Z Channels: 2025-05-07T19:45:55.5195650Z - nvidia/label/cuda-11.8.0 2025-05-07T19:45:55.5195957Z - defaults 2025-05-07T19:45:55.5196195Z Platform: linux-64 2025-05-07T19:45:56.6980638Z Collecting package metadata (repodata.json): - \ | / - \ | done 2025-05-07T19:45:56.9141396Z Solving environment: - \ done 2025-05-07T19:45:57.0372491Z 2025-05-07T19:45:57.0373052Z ## Package Plan ## 2025-05-07T19:45:57.0373228Z 2025-05-07T19:45:57.0373447Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:57.0373800Z 2025-05-07T19:45:57.0373906Z added / updated specs: 2025-05-07T19:45:57.0374202Z - cuda 2025-05-07T19:45:57.0374329Z 2025-05-07T19:45:57.0374333Z 2025-05-07T19:45:57.0374461Z The following packages will be downloaded: 2025-05-07T19:45:57.0374710Z 2025-05-07T19:45:57.0374878Z package | build 2025-05-07T19:45:57.0375321Z ---------------------------|----------------- 2025-05-07T19:45:57.0375726Z cuda-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0376215Z cuda-cccl-11.8.89 | 0 1.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0376768Z cuda-command-line-tools-11.8.0| 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0377348Z cuda-compiler-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0377965Z cuda-cudart-11.8.89 | 0 197 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0378483Z cuda-cudart-dev-11.8.89 | 0 1.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0379074Z cuda-cuobjdump-11.8.86 | 0 229 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0380089Z cuda-cupti-11.8.87 | 0 25.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0380602Z cuda-cuxxfilt-11.8.86 | 0 291 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0381132Z cuda-demo-suite-11.8.86 | 0 5.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0381665Z cuda-documentation-11.8.86 | 0 89 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0382214Z cuda-driver-dev-11.8.89 | 0 16 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0382697Z cuda-gdb-11.8.86 | 0 4.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0383295Z cuda-libraries-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0383802Z cuda-libraries-dev-11.8.0 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0384292Z cuda-memcheck-11.8.86 | 0 168 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0384786Z cuda-nsight-11.8.86 | 0 113.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0385280Z cuda-nsight-compute-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0385999Z cuda-nvcc-11.8.89 | 0 50.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0386647Z cuda-nvdisasm-11.8.86 | 0 48.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0387175Z cuda-nvml-dev-11.8.86 | 0 83 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0387695Z cuda-nvprof-11.8.87 | 0 4.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0388193Z cuda-nvprune-11.8.86 | 0 65 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0388709Z cuda-nvrtc-11.8.89 | 0 19.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0389213Z cuda-nvrtc-dev-11.8.89 | 0 17.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0389729Z cuda-nvtx-11.8.86 | 0 57 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0390219Z cuda-nvvp-11.8.87 | 0 114.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0390730Z cuda-profiler-api-11.8.86 | 0 18 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0391272Z cuda-runtime-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0391796Z cuda-sanitizer-api-11.8.86 | 0 16.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0392334Z cuda-toolkit-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0392941Z cuda-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0393449Z cuda-visual-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0393984Z gds-tools-1.4.0.31 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0394464Z libcublas-11.11.3.6 | 0 364.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0394979Z libcublas-dev-11.11.3.6 | 0 394.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0395493Z libcufft-10.9.0.58 | 0 142.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0395987Z libcufft-dev-10.9.0.58 | 0 275.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0396497Z libcufile-1.4.0.31 | 0 548 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0396995Z libcufile-dev-1.4.0.31 | 0 1.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0397513Z libcurand-10.3.0.86 | 0 53.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0398184Z libcurand-dev-10.3.0.86 | 0 53.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0398808Z libcusolver-11.4.1.48 | 0 96.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0399446Z libcusolver-dev-11.4.1.48 | 0 66.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0399929Z libcusparse-11.7.5.86 | 0 176.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0400434Z libcusparse-dev-11.7.5.86 | 0 359.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0400901Z libnpp-11.8.0.86 | 0 147.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0401365Z libnpp-dev-11.8.0.86 | 0 144.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0401838Z libnvjpeg-11.9.0.86 | 0 2.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0402306Z libnvjpeg-dev-11.9.0.86 | 0 2.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0402828Z nsight-compute-2022.3.0.22 | 0 610.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:57.0403281Z ------------------------------------------------------------ 2025-05-07T19:45:57.0403641Z Total: 3.24 GB 2025-05-07T19:45:57.0403852Z 2025-05-07T19:45:57.0404000Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:57.0404225Z 2025-05-07T19:45:57.0404407Z cuda nvidia/label/cuda-11.8.0/linux-64::cuda-11.8.0-0 2025-05-07T19:45:57.0404887Z cuda-cccl nvidia/label/cuda-11.8.0/linux-64::cuda-cccl-11.8.89-0 2025-05-07T19:45:57.0405463Z cuda-command-line~ nvidia/label/cuda-11.8.0/linux-64::cuda-command-line-tools-11.8.0-0 2025-05-07T19:45:57.0406094Z cuda-compiler nvidia/label/cuda-11.8.0/linux-64::cuda-compiler-11.8.0-0 2025-05-07T19:45:57.0406654Z cuda-cudart nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-11.8.89-0 2025-05-07T19:45:57.0407213Z cuda-cudart-dev nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-dev-11.8.89-0 2025-05-07T19:45:57.0407815Z cuda-cuobjdump nvidia/label/cuda-11.8.0/linux-64::cuda-cuobjdump-11.8.86-0 2025-05-07T19:45:57.0408360Z cuda-cupti nvidia/label/cuda-11.8.0/linux-64::cuda-cupti-11.8.87-0 2025-05-07T19:45:57.0408919Z cuda-cuxxfilt nvidia/label/cuda-11.8.0/linux-64::cuda-cuxxfilt-11.8.86-0 2025-05-07T19:45:57.0409510Z cuda-demo-suite nvidia/label/cuda-11.8.0/linux-64::cuda-demo-suite-11.8.86-0 2025-05-07T19:45:57.0410117Z cuda-documentation nvidia/label/cuda-11.8.0/linux-64::cuda-documentation-11.8.86-0 2025-05-07T19:45:57.0410747Z cuda-driver-dev nvidia/label/cuda-11.8.0/linux-64::cuda-driver-dev-11.8.89-0 2025-05-07T19:45:57.0411274Z cuda-gdb nvidia/label/cuda-11.8.0/linux-64::cuda-gdb-11.8.86-0 2025-05-07T19:45:57.0411821Z cuda-libraries nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-11.8.0-0 2025-05-07T19:45:57.0412440Z cuda-libraries-dev nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-dev-11.8.0-0 2025-05-07T19:45:57.0413033Z cuda-memcheck nvidia/label/cuda-11.8.0/linux-64::cuda-memcheck-11.8.86-0 2025-05-07T19:45:57.0413599Z cuda-nsight nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-11.8.86-0 2025-05-07T19:45:57.0414177Z cuda-nsight-compu~ nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-compute-11.8.0-0 2025-05-07T19:45:57.0414753Z cuda-nvcc nvidia/label/cuda-11.8.0/linux-64::cuda-nvcc-11.8.89-0 2025-05-07T19:45:57.0415293Z cuda-nvdisasm nvidia/label/cuda-11.8.0/linux-64::cuda-nvdisasm-11.8.86-0 2025-05-07T19:45:57.0415845Z cuda-nvml-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvml-dev-11.8.86-0 2025-05-07T19:45:57.0416398Z cuda-nvprof nvidia/label/cuda-11.8.0/linux-64::cuda-nvprof-11.8.87-0 2025-05-07T19:45:57.0416930Z cuda-nvprune nvidia/label/cuda-11.8.0/linux-64::cuda-nvprune-11.8.86-0 2025-05-07T19:45:57.0417466Z cuda-nvrtc nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-11.8.89-0 2025-05-07T19:45:57.0418191Z cuda-nvrtc-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-dev-11.8.89-0 2025-05-07T19:45:57.0418718Z cuda-nvtx nvidia/label/cuda-11.8.0/linux-64::cuda-nvtx-11.8.86-0 2025-05-07T19:45:57.0419219Z cuda-nvvp nvidia/label/cuda-11.8.0/linux-64::cuda-nvvp-11.8.87-0 2025-05-07T19:45:57.0419768Z cuda-profiler-api nvidia/label/cuda-11.8.0/linux-64::cuda-profiler-api-11.8.86-0 2025-05-07T19:45:57.0420356Z cuda-runtime nvidia/label/cuda-11.8.0/linux-64::cuda-runtime-11.8.0-0 2025-05-07T19:45:57.0420950Z cuda-sanitizer-api nvidia/label/cuda-11.8.0/linux-64::cuda-sanitizer-api-11.8.86-0 2025-05-07T19:45:57.0421532Z cuda-toolkit nvidia/label/cuda-11.8.0/linux-64::cuda-toolkit-11.8.0-0 2025-05-07T19:45:57.0422066Z cuda-tools nvidia/label/cuda-11.8.0/linux-64::cuda-tools-11.8.0-0 2025-05-07T19:45:57.0422616Z cuda-visual-tools nvidia/label/cuda-11.8.0/linux-64::cuda-visual-tools-11.8.0-0 2025-05-07T19:45:57.0423192Z gds-tools nvidia/label/cuda-11.8.0/linux-64::gds-tools-1.4.0.31-0 2025-05-07T19:45:57.0423712Z libcublas nvidia/label/cuda-11.8.0/linux-64::libcublas-11.11.3.6-0 2025-05-07T19:45:57.0424253Z libcublas-dev nvidia/label/cuda-11.8.0/linux-64::libcublas-dev-11.11.3.6-0 2025-05-07T19:45:57.0424807Z libcufft nvidia/label/cuda-11.8.0/linux-64::libcufft-10.9.0.58-0 2025-05-07T19:45:57.0425332Z libcufft-dev nvidia/label/cuda-11.8.0/linux-64::libcufft-dev-10.9.0.58-0 2025-05-07T19:45:57.0425881Z libcufile nvidia/label/cuda-11.8.0/linux-64::libcufile-1.4.0.31-0 2025-05-07T19:45:57.0426436Z libcufile-dev nvidia/label/cuda-11.8.0/linux-64::libcufile-dev-1.4.0.31-0 2025-05-07T19:45:57.0426967Z libcurand nvidia/label/cuda-11.8.0/linux-64::libcurand-10.3.0.86-0 2025-05-07T19:45:57.0427520Z libcurand-dev nvidia/label/cuda-11.8.0/linux-64::libcurand-dev-10.3.0.86-0 2025-05-07T19:45:57.0428078Z libcusolver nvidia/label/cuda-11.8.0/linux-64::libcusolver-11.4.1.48-0 2025-05-07T19:45:57.0428672Z libcusolver-dev nvidia/label/cuda-11.8.0/linux-64::libcusolver-dev-11.4.1.48-0 2025-05-07T19:45:57.0429265Z libcusparse nvidia/label/cuda-11.8.0/linux-64::libcusparse-11.7.5.86-0 2025-05-07T19:45:57.0429839Z libcusparse-dev nvidia/label/cuda-11.8.0/linux-64::libcusparse-dev-11.7.5.86-0 2025-05-07T19:45:57.0430391Z libnpp nvidia/label/cuda-11.8.0/linux-64::libnpp-11.8.0.86-0 2025-05-07T19:45:57.0430887Z libnpp-dev nvidia/label/cuda-11.8.0/linux-64::libnpp-dev-11.8.0.86-0 2025-05-07T19:45:57.0431424Z libnvjpeg nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-11.9.0.86-0 2025-05-07T19:45:57.0431984Z libnvjpeg-dev nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-dev-11.9.0.86-0 2025-05-07T19:45:57.0432665Z nsight-compute nvidia/label/cuda-11.8.0/linux-64::nsight-compute-2022.3.0.22-0 2025-05-07T19:45:57.0433236Z 2025-05-07T19:45:57.0474391Z 2025-05-07T19:45:57.0474523Z 2025-05-07T19:45:57.0475194Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:57.0476853Z nsight-compute-2022. | 610.0 MB | | 0% 2025-05-07T19:45:57.0477619Z 2025-05-07T19:45:57.0486459Z libcublas-dev-11.11. | 394.1 MB | | 0%  2025-05-07T19:45:57.0487282Z 2025-05-07T19:45:57.0487294Z 2025-05-07T19:45:57.0505786Z libcublas-11.11.3.6 | 364.0 MB | | 0%  2025-05-07T19:45:57.0506665Z 2025-05-07T19:45:57.0506680Z 2025-05-07T19:45:57.0506716Z 2025-05-07T19:45:57.0520161Z libcusparse-dev-11.7 | 359.7 MB | | 0%  2025-05-07T19:45:57.0521028Z 2025-05-07T19:45:57.0521039Z 2025-05-07T19:45:57.0521050Z 2025-05-07T19:45:57.0521061Z 2025-05-07T19:45:57.0524205Z libcufft-dev-10.9.0. | 275.8 MB | | 0%  2025-05-07T19:45:57.0524519Z 2025-05-07T19:45:57.0524523Z 2025-05-07T19:45:57.0524527Z 2025-05-07T19:45:57.0524531Z 2025-05-07T19:45:57.0524534Z 2025-05-07T19:45:57.0525830Z libcusparse-11.7.5.8 | 176.3 MB | | 0%  2025-05-07T19:45:57.0526258Z 2025-05-07T19:45:57.0526262Z 2025-05-07T19:45:57.0526266Z 2025-05-07T19:45:57.0526270Z 2025-05-07T19:45:57.0526273Z 2025-05-07T19:45:57.0526277Z 2025-05-07T19:45:57.0526827Z libnpp-11.8.0.86 | 147.8 MB | | 0%  2025-05-07T19:45:57.0527135Z 2025-05-07T19:45:57.0527151Z 2025-05-07T19:45:57.0527154Z 2025-05-07T19:45:57.0527158Z 2025-05-07T19:45:57.0527162Z 2025-05-07T19:45:57.0527166Z 2025-05-07T19:45:57.0527169Z 2025-05-07T19:45:57.0528303Z libnpp-dev-11.8.0.86 | 144.5 MB | | 0%  2025-05-07T19:45:57.0528632Z 2025-05-07T19:45:57.0528636Z 2025-05-07T19:45:57.0528639Z 2025-05-07T19:45:57.0528642Z 2025-05-07T19:45:57.0528646Z 2025-05-07T19:45:57.0528649Z 2025-05-07T19:45:57.0528653Z 2025-05-07T19:45:57.0528656Z 2025-05-07T19:45:57.0529678Z libcufft-10.9.0.58 | 142.8 MB | | 0%  2025-05-07T19:45:57.0530014Z 2025-05-07T19:45:57.0530018Z 2025-05-07T19:45:57.0530026Z 2025-05-07T19:45:57.0530030Z 2025-05-07T19:45:57.0530034Z 2025-05-07T19:45:57.0530038Z 2025-05-07T19:45:57.0530041Z 2025-05-07T19:45:57.0530045Z 2025-05-07T19:45:57.0530048Z 2025-05-07T19:45:57.0538807Z cuda-nvvp-11.8.87 | 114.4 MB | | 0%  2025-05-07T19:45:57.0539141Z 2025-05-07T19:45:57.0539145Z 2025-05-07T19:45:57.0539149Z 2025-05-07T19:45:57.0539152Z 2025-05-07T19:45:57.0539155Z 2025-05-07T19:45:57.0539159Z 2025-05-07T19:45:57.0539162Z 2025-05-07T19:45:57.0539165Z 2025-05-07T19:45:57.0539169Z 2025-05-07T19:45:57.0539173Z 2025-05-07T19:45:57.0540267Z cuda-nsight-11.8.86 | 113.6 MB | | 0%  2025-05-07T19:45:57.0540597Z 2025-05-07T19:45:57.0540601Z 2025-05-07T19:45:57.0540604Z 2025-05-07T19:45:57.0540608Z 2025-05-07T19:45:57.0540612Z 2025-05-07T19:45:57.0540615Z 2025-05-07T19:45:57.0540618Z 2025-05-07T19:45:57.0540622Z 2025-05-07T19:45:57.0540632Z 2025-05-07T19:45:57.0540635Z 2025-05-07T19:45:57.0540643Z 2025-05-07T19:45:57.0541702Z libcusolver-11.4.1.4 | 96.5 MB | | 0%  2025-05-07T19:45:57.0542035Z 2025-05-07T19:45:57.0542039Z 2025-05-07T19:45:57.0542043Z 2025-05-07T19:45:57.0542046Z 2025-05-07T19:45:57.0542051Z 2025-05-07T19:45:57.0542054Z 2025-05-07T19:45:57.0542058Z 2025-05-07T19:45:57.0542061Z 2025-05-07T19:45:57.0542065Z 2025-05-07T19:45:57.0542068Z 2025-05-07T19:45:57.0542091Z 2025-05-07T19:45:57.0542094Z 2025-05-07T19:45:57.0543058Z libcusolver-dev-11.4 | 66.3 MB | | 0%  2025-05-07T19:45:57.0543401Z 2025-05-07T19:45:57.0543405Z 2025-05-07T19:45:57.0543408Z 2025-05-07T19:45:57.0543412Z 2025-05-07T19:45:57.0543415Z 2025-05-07T19:45:57.0543419Z 2025-05-07T19:45:57.0543441Z 2025-05-07T19:45:57.0543444Z 2025-05-07T19:45:57.0543447Z 2025-05-07T19:45:57.0543451Z 2025-05-07T19:45:57.0543454Z 2025-05-07T19:45:57.0543463Z 2025-05-07T19:45:57.0543466Z 2025-05-07T19:45:57.0545094Z libcurand-dev-10.3.0 | 53.7 MB | | 0%  2025-05-07T19:45:57.0545499Z 2025-05-07T19:45:57.0545503Z 2025-05-07T19:45:57.0545507Z 2025-05-07T19:45:57.0545539Z 2025-05-07T19:45:57.0545543Z 2025-05-07T19:45:57.0545547Z 2025-05-07T19:45:57.0545550Z 2025-05-07T19:45:57.0545554Z 2025-05-07T19:45:57.0545557Z 2025-05-07T19:45:57.0545561Z 2025-05-07T19:45:57.0545565Z 2025-05-07T19:45:57.0545569Z 2025-05-07T19:45:57.0545572Z 2025-05-07T19:45:57.0545576Z 2025-05-07T19:45:57.0545910Z libcurand-10.3.0.86 | 53.2 MB | | 0%  2025-05-07T19:45:57.0546238Z 2025-05-07T19:45:57.0546242Z 2025-05-07T19:45:57.0546246Z 2025-05-07T19:45:57.0546250Z 2025-05-07T19:45:57.0546254Z 2025-05-07T19:45:57.0546258Z 2025-05-07T19:45:57.0546262Z 2025-05-07T19:45:57.0546265Z 2025-05-07T19:45:57.0546269Z 2025-05-07T19:45:57.0546272Z 2025-05-07T19:45:57.0546503Z 2025-05-07T19:45:57.0546507Z 2025-05-07T19:45:57.0546696Z 2025-05-07T19:45:57.0546700Z 2025-05-07T19:45:57.0546735Z 2025-05-07T19:45:57.0548885Z cuda-nvcc-11.8.89 | 50.8 MB | | 0%  2025-05-07T19:45:57.0549210Z 2025-05-07T19:45:57.0549213Z 2025-05-07T19:45:57.0549217Z 2025-05-07T19:45:57.0549234Z 2025-05-07T19:45:57.0549238Z 2025-05-07T19:45:57.0549242Z 2025-05-07T19:45:57.0549263Z 2025-05-07T19:45:57.0549266Z 2025-05-07T19:45:57.0549270Z 2025-05-07T19:45:57.0549273Z 2025-05-07T19:45:57.0549276Z 2025-05-07T19:45:57.0549280Z 2025-05-07T19:45:57.0549283Z 2025-05-07T19:45:57.0549286Z 2025-05-07T19:45:57.0549290Z 2025-05-07T19:45:57.0549293Z 2025-05-07T19:45:57.0554550Z cuda-nvdisasm-11.8.8 | 48.7 MB | | 0%  2025-05-07T19:45:57.0554927Z 2025-05-07T19:45:57.0554931Z 2025-05-07T19:45:57.0554934Z 2025-05-07T19:45:57.0554938Z 2025-05-07T19:45:57.0554941Z 2025-05-07T19:45:57.0554952Z 2025-05-07T19:45:57.0554961Z 2025-05-07T19:45:57.0554965Z 2025-05-07T19:45:57.0554968Z 2025-05-07T19:45:57.0554972Z 2025-05-07T19:45:57.0554975Z 2025-05-07T19:45:57.0554979Z 2025-05-07T19:45:57.0554982Z 2025-05-07T19:45:57.0554986Z 2025-05-07T19:45:57.0554990Z 2025-05-07T19:45:57.0554994Z 2025-05-07T19:45:57.0554997Z 2025-05-07T19:45:57.0556498Z cuda-cupti-11.8.87 | 25.3 MB | | 0%  2025-05-07T19:45:57.0556868Z 2025-05-07T19:45:57.0556873Z 2025-05-07T19:45:57.0556878Z 2025-05-07T19:45:57.0556883Z 2025-05-07T19:45:57.0556887Z 2025-05-07T19:45:57.0556910Z 2025-05-07T19:45:57.0556914Z 2025-05-07T19:45:57.0556918Z 2025-05-07T19:45:57.0556922Z 2025-05-07T19:45:57.0556927Z 2025-05-07T19:45:57.0556931Z 2025-05-07T19:45:57.0556935Z 2025-05-07T19:45:57.0556939Z 2025-05-07T19:45:57.0556957Z 2025-05-07T19:45:57.0556961Z 2025-05-07T19:45:57.0556966Z 2025-05-07T19:45:57.0556969Z 2025-05-07T19:45:57.0556972Z 2025-05-07T19:45:57.0566440Z cuda-nvrtc-11.8.89 | 19.1 MB | | 0%  2025-05-07T19:45:57.0566836Z 2025-05-07T19:45:57.0566841Z 2025-05-07T19:45:57.0566859Z 2025-05-07T19:45:57.0566863Z 2025-05-07T19:45:57.0566868Z 2025-05-07T19:45:57.0566872Z 2025-05-07T19:45:57.0566876Z 2025-05-07T19:45:57.0566880Z 2025-05-07T19:45:57.0566885Z 2025-05-07T19:45:57.0566913Z 2025-05-07T19:45:57.0566918Z 2025-05-07T19:45:57.0566922Z 2025-05-07T19:45:57.0566926Z 2025-05-07T19:45:57.0566931Z 2025-05-07T19:45:57.0566935Z 2025-05-07T19:45:57.0566939Z 2025-05-07T19:45:57.0566944Z 2025-05-07T19:45:57.0566948Z 2025-05-07T19:45:57.0566953Z 2025-05-07T19:46:00.5075017Z ... (more hidden) ... 2025-05-07T19:46:00.5075552Z 2025-05-07T19:46:00.5075561Z 2025-05-07T19:46:00.5075568Z 2025-05-07T19:46:00.5075575Z 2025-05-07T19:46:00.5076269Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:00.5076745Z 2025-05-07T19:46:00.5076764Z 2025-05-07T19:46:00.5076768Z 2025-05-07T19:46:00.5076771Z 2025-05-07T19:46:02.0773786Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:02.0774725Z 2025-05-07T19:46:02.0774760Z 2025-05-07T19:46:02.0775466Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:02.0776292Z 2025-05-07T19:46:02.0776303Z 2025-05-07T19:46:03.9041203Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:03.9041528Z 2025-05-07T19:46:03.9041536Z 2025-05-07T19:46:03.9041544Z 2025-05-07T19:46:03.9041554Z 2025-05-07T19:46:03.9041602Z 2025-05-07T19:46:03.9042019Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:03.9042458Z 2025-05-07T19:46:03.9042465Z 2025-05-07T19:46:03.9042471Z 2025-05-07T19:46:03.9042477Z 2025-05-07T19:46:03.9042484Z 2025-05-07T19:46:04.0956369Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:04.0957025Z 2025-05-07T19:46:04.0957160Z 2025-05-07T19:46:04.0957165Z 2025-05-07T19:46:04.0957169Z 2025-05-07T19:46:04.0957172Z 2025-05-07T19:46:04.0957176Z 2025-05-07T19:46:04.0957450Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:04.0957752Z 2025-05-07T19:46:04.0957756Z 2025-05-07T19:46:04.0957759Z 2025-05-07T19:46:04.0957763Z 2025-05-07T19:46:04.0957766Z 2025-05-07T19:46:04.0957770Z 2025-05-07T19:46:05.2120028Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:05.2120389Z 2025-05-07T19:46:05.2120687Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:05.2120954Z 2025-05-07T19:46:05.2746626Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:05.2746938Z 2025-05-07T19:46:05.2746943Z 2025-05-07T19:46:05.2746948Z 2025-05-07T19:46:05.2747201Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:05.2747602Z 2025-05-07T19:46:05.2747644Z 2025-05-07T19:46:05.2747650Z 2025-05-07T19:46:06.4675191Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:06.4675556Z 2025-05-07T19:46:06.4675562Z 2025-05-07T19:46:06.4675566Z 2025-05-07T19:46:06.4675570Z 2025-05-07T19:46:06.4675574Z 2025-05-07T19:46:06.4675581Z 2025-05-07T19:46:06.4675585Z 2025-05-07T19:46:06.4675866Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:06.4676170Z 2025-05-07T19:46:06.4676174Z 2025-05-07T19:46:06.4676177Z 2025-05-07T19:46:06.4676181Z 2025-05-07T19:46:06.4676197Z 2025-05-07T19:46:06.4676200Z 2025-05-07T19:46:06.4676204Z 2025-05-07T19:46:06.4931791Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:06.4932140Z 2025-05-07T19:46:06.4932145Z 2025-05-07T19:46:06.4932150Z 2025-05-07T19:46:06.4932154Z 2025-05-07T19:46:06.4932171Z 2025-05-07T19:46:06.4932176Z 2025-05-07T19:46:06.4932181Z 2025-05-07T19:46:06.4932187Z 2025-05-07T19:46:06.4932471Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:06.4932780Z 2025-05-07T19:46:06.4932784Z 2025-05-07T19:46:06.4932787Z 2025-05-07T19:46:06.4932791Z 2025-05-07T19:46:06.4932794Z 2025-05-07T19:46:06.4932797Z 2025-05-07T19:46:06.4932813Z 2025-05-07T19:46:06.4932823Z 2025-05-07T19:46:07.1405114Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:07.1405460Z 2025-05-07T19:46:07.1405465Z 2025-05-07T19:46:07.1405469Z 2025-05-07T19:46:07.1405473Z 2025-05-07T19:46:07.1405477Z 2025-05-07T19:46:07.1405482Z 2025-05-07T19:46:07.1405488Z 2025-05-07T19:46:07.1405494Z 2025-05-07T19:46:07.1405500Z 2025-05-07T19:46:07.1405530Z 2025-05-07T19:46:07.1405940Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:07.1406306Z 2025-05-07T19:46:07.1406313Z 2025-05-07T19:46:07.1406320Z 2025-05-07T19:46:07.1406326Z 2025-05-07T19:46:07.1406337Z 2025-05-07T19:46:07.1406343Z 2025-05-07T19:46:07.1406383Z 2025-05-07T19:46:07.1406402Z 2025-05-07T19:46:07.1406408Z 2025-05-07T19:46:07.1406415Z 2025-05-07T19:46:07.7975448Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:07.7975798Z 2025-05-07T19:46:07.7975804Z 2025-05-07T19:46:07.7975812Z 2025-05-07T19:46:07.7975816Z 2025-05-07T19:46:07.7975835Z 2025-05-07T19:46:07.7975839Z 2025-05-07T19:46:07.7975844Z 2025-05-07T19:46:07.7975848Z 2025-05-07T19:46:07.7975851Z 2025-05-07T19:46:07.7976126Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:07.7976438Z 2025-05-07T19:46:07.7976443Z 2025-05-07T19:46:07.7976447Z 2025-05-07T19:46:07.7976451Z 2025-05-07T19:46:07.7976455Z 2025-05-07T19:46:07.7976459Z 2025-05-07T19:46:07.7976463Z 2025-05-07T19:46:07.7976468Z 2025-05-07T19:46:07.7976476Z 2025-05-07T19:46:07.8233409Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:07.8234016Z 2025-05-07T19:46:07.8234336Z 2025-05-07T19:46:07.8234481Z 2025-05-07T19:46:07.8234486Z 2025-05-07T19:46:07.8234490Z 2025-05-07T19:46:07.8234493Z 2025-05-07T19:46:07.8234497Z 2025-05-07T19:46:07.8234500Z 2025-05-07T19:46:07.8234504Z 2025-05-07T19:46:07.8234508Z 2025-05-07T19:46:07.8234511Z 2025-05-07T19:46:07.8234515Z 2025-05-07T19:46:07.8235067Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:46:07.8235431Z 2025-05-07T19:46:07.8235435Z 2025-05-07T19:46:07.8235439Z 2025-05-07T19:46:07.8235442Z 2025-05-07T19:46:07.8235445Z 2025-05-07T19:46:07.8235449Z 2025-05-07T19:46:07.8235452Z 2025-05-07T19:46:07.8235456Z 2025-05-07T19:46:07.8235459Z 2025-05-07T19:46:07.8235462Z 2025-05-07T19:46:07.8235466Z 2025-05-07T19:46:07.8235469Z 2025-05-07T19:46:07.9893330Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:46:07.9893721Z 2025-05-07T19:46:07.9893726Z 2025-05-07T19:46:07.9893731Z 2025-05-07T19:46:07.9893751Z 2025-05-07T19:46:07.9893755Z 2025-05-07T19:46:07.9893773Z 2025-05-07T19:46:07.9893776Z 2025-05-07T19:46:07.9893780Z 2025-05-07T19:46:07.9893783Z 2025-05-07T19:46:07.9893787Z 2025-05-07T19:46:07.9893790Z 2025-05-07T19:46:07.9894096Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:46:07.9894412Z 2025-05-07T19:46:07.9894416Z 2025-05-07T19:46:07.9894420Z 2025-05-07T19:46:07.9894423Z 2025-05-07T19:46:07.9894426Z 2025-05-07T19:46:07.9894430Z 2025-05-07T19:46:07.9894433Z 2025-05-07T19:46:07.9894436Z 2025-05-07T19:46:07.9894440Z 2025-05-07T19:46:07.9894443Z 2025-05-07T19:46:07.9894446Z 2025-05-07T19:46:08.0074260Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:46:08.0074621Z 2025-05-07T19:46:08.0074626Z 2025-05-07T19:46:08.0074629Z 2025-05-07T19:46:08.0074633Z 2025-05-07T19:46:08.0074636Z 2025-05-07T19:46:08.0074640Z 2025-05-07T19:46:08.0074643Z 2025-05-07T19:46:08.0074659Z 2025-05-07T19:46:08.0074663Z 2025-05-07T19:46:08.0074675Z 2025-05-07T19:46:08.0074693Z 2025-05-07T19:46:08.0074696Z 2025-05-07T19:46:08.0074700Z 2025-05-07T19:46:08.0075009Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:08.0075339Z 2025-05-07T19:46:08.0075343Z 2025-05-07T19:46:08.0075347Z 2025-05-07T19:46:08.0075351Z 2025-05-07T19:46:08.0075354Z 2025-05-07T19:46:08.0075358Z 2025-05-07T19:46:08.0075361Z 2025-05-07T19:46:08.0075378Z 2025-05-07T19:46:08.0075381Z 2025-05-07T19:46:08.0075385Z 2025-05-07T19:46:08.0075388Z 2025-05-07T19:46:08.0075392Z 2025-05-07T19:46:08.0075395Z 2025-05-07T19:46:08.1715098Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:08.1715606Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:46:08.5033156Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:46:08.5033513Z 2025-05-07T19:46:08.5033532Z 2025-05-07T19:46:08.5033784Z 2025-05-07T19:46:08.5033822Z 2025-05-07T19:46:08.5033829Z 2025-05-07T19:46:08.5033836Z 2025-05-07T19:46:08.5033841Z 2025-05-07T19:46:08.5033846Z 2025-05-07T19:46:08.5033850Z 2025-05-07T19:46:08.5033856Z 2025-05-07T19:46:08.5033862Z 2025-05-07T19:46:08.5033902Z 2025-05-07T19:46:08.5033907Z 2025-05-07T19:46:08.5033911Z 2025-05-07T19:46:08.5033915Z 2025-05-07T19:46:08.5033920Z 2025-05-07T19:46:08.5033945Z 2025-05-07T19:46:08.5034617Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:46:08.5034976Z 2025-05-07T19:46:08.5034980Z 2025-05-07T19:46:08.5034995Z 2025-05-07T19:46:08.5034999Z 2025-05-07T19:46:08.5035002Z 2025-05-07T19:46:08.5035005Z 2025-05-07T19:46:08.5035009Z 2025-05-07T19:46:08.5035012Z 2025-05-07T19:46:08.5035016Z 2025-05-07T19:46:08.5035019Z 2025-05-07T19:46:08.5035036Z 2025-05-07T19:46:08.5035039Z 2025-05-07T19:46:08.5035043Z 2025-05-07T19:46:08.5035047Z 2025-05-07T19:46:08.5035051Z 2025-05-07T19:46:08.5035292Z 2025-05-07T19:46:08.5035406Z 2025-05-07T19:46:08.7803515Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:46:08.7803886Z 2025-05-07T19:46:08.7803892Z 2025-05-07T19:46:08.7803897Z 2025-05-07T19:46:08.7803901Z 2025-05-07T19:46:08.7803905Z 2025-05-07T19:46:08.7803909Z 2025-05-07T19:46:08.7803913Z 2025-05-07T19:46:08.7803917Z 2025-05-07T19:46:08.7803921Z 2025-05-07T19:46:08.7803925Z 2025-05-07T19:46:08.7803930Z 2025-05-07T19:46:08.7803946Z 2025-05-07T19:46:08.7803950Z 2025-05-07T19:46:08.7803971Z 2025-05-07T19:46:08.7803975Z 2025-05-07T19:46:08.7803979Z 2025-05-07T19:46:08.7804310Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:46:08.7804653Z 2025-05-07T19:46:08.7804657Z 2025-05-07T19:46:08.7804674Z 2025-05-07T19:46:08.7804681Z 2025-05-07T19:46:08.7804685Z 2025-05-07T19:46:08.7804689Z 2025-05-07T19:46:08.7804693Z 2025-05-07T19:46:08.7804720Z 2025-05-07T19:46:08.7804739Z 2025-05-07T19:46:08.7804742Z 2025-05-07T19:46:08.7804746Z 2025-05-07T19:46:08.7804749Z 2025-05-07T19:46:08.7804752Z 2025-05-07T19:46:08.7804756Z 2025-05-07T19:46:08.7804759Z 2025-05-07T19:46:08.7804763Z 2025-05-07T19:46:08.8048614Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:46:08.8049007Z 2025-05-07T19:46:08.8049012Z 2025-05-07T19:46:08.8049016Z 2025-05-07T19:46:08.8049020Z 2025-05-07T19:46:08.8049023Z 2025-05-07T19:46:08.8049027Z 2025-05-07T19:46:08.8049030Z 2025-05-07T19:46:08.8049034Z 2025-05-07T19:46:08.8049037Z 2025-05-07T19:46:08.8049041Z 2025-05-07T19:46:08.8049044Z 2025-05-07T19:46:08.8049047Z 2025-05-07T19:46:08.8049051Z 2025-05-07T19:46:08.8049054Z 2025-05-07T19:46:08.8049058Z 2025-05-07T19:46:08.8049074Z 2025-05-07T19:46:08.8049077Z 2025-05-07T19:46:08.8049081Z 2025-05-07T19:46:08.8049407Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:46:08.8049761Z 2025-05-07T19:46:08.8049765Z 2025-05-07T19:46:08.8049769Z 2025-05-07T19:46:08.8049772Z 2025-05-07T19:46:08.8049775Z 2025-05-07T19:46:08.8049779Z 2025-05-07T19:46:08.8049782Z 2025-05-07T19:46:08.8049798Z 2025-05-07T19:46:08.8049802Z 2025-05-07T19:46:08.8049805Z 2025-05-07T19:46:08.8049809Z 2025-05-07T19:46:08.8049812Z 2025-05-07T19:46:08.8049815Z 2025-05-07T19:46:08.8049819Z 2025-05-07T19:46:08.8049822Z 2025-05-07T19:46:08.8049826Z 2025-05-07T19:46:08.8049829Z 2025-05-07T19:46:08.8049832Z 2025-05-07T19:46:08.8541529Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:46:08.8541922Z 2025-05-07T19:46:08.8541927Z 2025-05-07T19:46:08.8541930Z 2025-05-07T19:46:08.8541934Z 2025-05-07T19:46:08.8541937Z 2025-05-07T19:46:08.8541941Z 2025-05-07T19:46:08.8541946Z 2025-05-07T19:46:08.8541950Z 2025-05-07T19:46:08.8541953Z 2025-05-07T19:46:08.8541957Z 2025-05-07T19:46:08.8541977Z 2025-05-07T19:46:08.8541989Z 2025-05-07T19:46:08.8541992Z 2025-05-07T19:46:08.8541996Z 2025-05-07T19:46:08.8541999Z 2025-05-07T19:46:08.8542328Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:46:08.8542652Z 2025-05-07T19:46:08.8542655Z 2025-05-07T19:46:08.8542659Z 2025-05-07T19:46:08.8542662Z 2025-05-07T19:46:08.8542666Z 2025-05-07T19:46:08.8542669Z 2025-05-07T19:46:08.8542673Z 2025-05-07T19:46:08.8542676Z 2025-05-07T19:46:08.8542679Z 2025-05-07T19:46:08.8542692Z 2025-05-07T19:46:08.8542695Z 2025-05-07T19:46:08.8542712Z 2025-05-07T19:46:08.8542715Z 2025-05-07T19:46:08.8542719Z 2025-05-07T19:46:08.8542722Z 2025-05-07T19:46:08.8918368Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:46:08.8918714Z 2025-05-07T19:46:08.8918744Z 2025-05-07T19:46:08.8918748Z 2025-05-07T19:46:08.8918751Z 2025-05-07T19:46:08.8918755Z 2025-05-07T19:46:08.8918978Z 2025-05-07T19:46:08.8918981Z 2025-05-07T19:46:08.8919122Z 2025-05-07T19:46:08.8919127Z 2025-05-07T19:46:08.8919131Z 2025-05-07T19:46:08.8919134Z 2025-05-07T19:46:08.8919137Z 2025-05-07T19:46:08.8919141Z 2025-05-07T19:46:08.8919144Z 2025-05-07T19:46:08.8919148Z 2025-05-07T19:46:08.8919151Z 2025-05-07T19:46:08.8919154Z 2025-05-07T19:46:08.8919158Z 2025-05-07T19:46:08.8919161Z 2025-05-07T19:46:08.8919432Z ... (more hidden) ... 2025-05-07T19:46:08.8919738Z 2025-05-07T19:46:08.8919742Z 2025-05-07T19:46:08.8919745Z 2025-05-07T19:46:08.8919749Z 2025-05-07T19:46:08.8919752Z 2025-05-07T19:46:08.8919756Z 2025-05-07T19:46:08.8919759Z 2025-05-07T19:46:08.8919763Z 2025-05-07T19:46:08.8919766Z 2025-05-07T19:46:08.8919769Z 2025-05-07T19:46:08.8919772Z 2025-05-07T19:46:08.8919776Z 2025-05-07T19:46:08.8919779Z 2025-05-07T19:46:08.8919783Z 2025-05-07T19:46:08.8919786Z 2025-05-07T19:46:08.8919790Z 2025-05-07T19:46:08.8919793Z 2025-05-07T19:46:08.8919802Z 2025-05-07T19:46:08.8919809Z 2025-05-07T19:46:09.0649503Z ... (more hidden) ... 2025-05-07T19:46:09.0649861Z 2025-05-07T19:46:09.0649866Z 2025-05-07T19:46:09.0649871Z 2025-05-07T19:46:09.0649874Z 2025-05-07T19:46:09.0649878Z 2025-05-07T19:46:09.0649881Z 2025-05-07T19:46:09.0649885Z 2025-05-07T19:46:09.0649889Z 2025-05-07T19:46:09.0649893Z 2025-05-07T19:46:09.0649912Z 2025-05-07T19:46:09.0649915Z 2025-05-07T19:46:09.0649919Z 2025-05-07T19:46:09.0649923Z 2025-05-07T19:46:09.0649927Z 2025-05-07T19:46:09.0650247Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:46:09.0650572Z 2025-05-07T19:46:09.0650577Z 2025-05-07T19:46:09.0650580Z 2025-05-07T19:46:09.0650584Z 2025-05-07T19:46:09.0650602Z 2025-05-07T19:46:09.0650606Z 2025-05-07T19:46:09.0650609Z 2025-05-07T19:46:09.0650613Z 2025-05-07T19:46:09.0650616Z 2025-05-07T19:46:09.0650620Z 2025-05-07T19:46:09.0650646Z 2025-05-07T19:46:09.0650650Z 2025-05-07T19:46:09.0650661Z 2025-05-07T19:46:09.0650664Z 2025-05-07T19:46:21.6489891Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:46:21.6490300Z 2025-05-07T19:46:21.6490322Z 2025-05-07T19:46:21.6490326Z 2025-05-07T19:46:21.6490331Z 2025-05-07T19:46:21.6490335Z 2025-05-07T19:46:30.8003243Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:30.8003593Z 2025-05-07T19:46:30.8003598Z 2025-05-07T19:46:30.8003602Z 2025-05-07T19:46:30.8003606Z 2025-05-07T19:46:35.7597913Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:35.7598269Z 2025-05-07T19:46:35.7598274Z 2025-05-07T19:46:35.7598279Z 2025-05-07T19:46:35.7598284Z 2025-05-07T19:46:35.7598288Z 2025-05-07T19:46:35.7598293Z 2025-05-07T19:46:38.8023556Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:38.8023919Z 2025-05-07T19:46:38.8023959Z 2025-05-07T19:46:52.7626316Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:52.7626672Z 2025-05-07T19:46:52.7626677Z 2025-05-07T19:46:52.7626682Z 2025-05-07T19:46:52.7626686Z 2025-05-07T19:46:52.7626690Z 2025-05-07T19:46:52.7626694Z 2025-05-07T19:46:52.7626698Z 2025-05-07T19:47:07.7318146Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:47:07.7318497Z 2025-05-07T19:47:07.7318503Z 2025-05-07T19:47:07.7318508Z 2025-05-07T19:47:07.7318511Z 2025-05-07T19:47:07.7318515Z 2025-05-07T19:47:07.7318518Z 2025-05-07T19:47:07.7318535Z 2025-05-07T19:47:07.7318550Z 2025-05-07T19:47:10.1027428Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:47:10.1027809Z 2025-05-07T19:47:10.1027814Z 2025-05-07T19:47:10.1027819Z 2025-05-07T19:47:12.3705788Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:47:12.3706162Z 2025-05-07T19:47:15.5438103Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:47:15.5438900Z 2025-05-07T19:47:15.5438936Z 2025-05-07T19:47:15.5438955Z 2025-05-07T19:47:15.5438958Z 2025-05-07T19:47:15.5438962Z 2025-05-07T19:47:15.5438965Z 2025-05-07T19:47:15.5438969Z 2025-05-07T19:47:15.5438972Z 2025-05-07T19:47:15.5438976Z 2025-05-07T19:47:15.5438979Z 2025-05-07T19:47:18.4475461Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:47:18.4475819Z 2025-05-07T19:47:18.4475840Z 2025-05-07T19:47:18.4475846Z 2025-05-07T19:47:18.4475851Z 2025-05-07T19:47:18.4475855Z 2025-05-07T19:47:18.4475859Z 2025-05-07T19:47:18.4475864Z 2025-05-07T19:47:18.4475869Z 2025-05-07T19:47:18.4475873Z 2025-05-07T19:47:21.8830020Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:47:21.8830414Z 2025-05-07T19:47:21.8830539Z 2025-05-07T19:47:21.8830548Z 2025-05-07T19:47:21.8830557Z 2025-05-07T19:47:21.8830563Z 2025-05-07T19:47:21.8830567Z 2025-05-07T19:47:21.8830640Z 2025-05-07T19:47:21.8830645Z 2025-05-07T19:47:21.8830667Z 2025-05-07T19:47:21.8830671Z 2025-05-07T19:47:21.8830676Z 2025-05-07T19:47:21.8830680Z 2025-05-07T19:47:23.1260802Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:47:23.1261384Z 2025-05-07T19:47:23.1261392Z 2025-05-07T19:47:23.1261397Z 2025-05-07T19:47:23.1261401Z 2025-05-07T19:47:23.1261404Z 2025-05-07T19:47:23.1261408Z 2025-05-07T19:47:23.1261412Z 2025-05-07T19:47:23.1261424Z 2025-05-07T19:47:23.1261428Z 2025-05-07T19:47:23.1261431Z 2025-05-07T19:47:23.1261435Z 2025-05-07T19:47:23.1261439Z 2025-05-07T19:47:23.1261442Z 2025-05-07T19:47:26.1520977Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:47:26.1521388Z 2025-05-07T19:47:26.1521394Z 2025-05-07T19:47:26.1521399Z 2025-05-07T19:47:26.1521403Z 2025-05-07T19:47:26.1521407Z 2025-05-07T19:47:26.1521412Z 2025-05-07T19:47:26.1521416Z 2025-05-07T19:47:26.1521453Z 2025-05-07T19:47:26.1521457Z 2025-05-07T19:47:26.1521478Z 2025-05-07T19:47:26.1521482Z 2025-05-07T19:47:26.1521485Z 2025-05-07T19:47:26.1521489Z 2025-05-07T19:47:26.1521508Z 2025-05-07T19:47:26.1521512Z 2025-05-07T19:47:26.1521515Z 2025-05-07T19:47:26.1521519Z 2025-05-07T19:47:29.2991571Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:47:29.2991996Z 2025-05-07T19:47:29.2992002Z 2025-05-07T19:47:29.2992006Z 2025-05-07T19:47:29.2992011Z 2025-05-07T19:47:29.2992016Z 2025-05-07T19:47:29.2992020Z 2025-05-07T19:47:29.2992024Z 2025-05-07T19:47:29.2992028Z 2025-05-07T19:47:29.2992033Z 2025-05-07T19:47:29.2992037Z 2025-05-07T19:47:29.2992041Z 2025-05-07T19:47:29.5350267Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:47:29.5350776Z 2025-05-07T19:47:29.5350951Z 2025-05-07T19:47:29.5350961Z 2025-05-07T19:47:29.5350968Z 2025-05-07T19:47:29.5350973Z 2025-05-07T19:47:29.5351007Z 2025-05-07T19:47:29.5351012Z 2025-05-07T19:47:29.5351034Z 2025-05-07T19:47:29.5351039Z 2025-05-07T19:47:29.5351044Z 2025-05-07T19:47:29.5351048Z 2025-05-07T19:47:29.5351053Z 2025-05-07T19:47:29.5351057Z 2025-05-07T19:47:29.5351063Z 2025-05-07T19:47:29.5351067Z 2025-05-07T19:47:29.5351078Z 2025-05-07T19:47:31.6351335Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:47:31.6351779Z 2025-05-07T19:47:31.6351785Z 2025-05-07T19:47:31.6351789Z 2025-05-07T19:47:31.6351794Z 2025-05-07T19:47:31.6351798Z 2025-05-07T19:47:31.6351802Z 2025-05-07T19:47:31.6351806Z 2025-05-07T19:47:31.6351811Z 2025-05-07T19:47:31.6351816Z 2025-05-07T19:47:31.6351820Z 2025-05-07T19:47:31.6351824Z 2025-05-07T19:47:31.6351827Z 2025-05-07T19:47:31.6351831Z 2025-05-07T19:47:31.6351835Z 2025-05-07T19:47:31.6351839Z 2025-05-07T19:47:31.6351842Z 2025-05-07T19:47:31.6351845Z 2025-05-07T19:47:31.6351849Z 2025-05-07T19:47:33.8739551Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:47:33.8740103Z 2025-05-07T19:47:33.8740108Z 2025-05-07T19:47:33.8740112Z 2025-05-07T19:47:33.8740115Z 2025-05-07T19:47:33.8740121Z 2025-05-07T19:47:33.8740124Z 2025-05-07T19:47:33.8740128Z 2025-05-07T19:47:33.8740145Z 2025-05-07T19:47:33.8740149Z 2025-05-07T19:47:33.8740152Z 2025-05-07T19:47:33.8740155Z 2025-05-07T19:47:33.8740159Z 2025-05-07T19:47:33.8740162Z 2025-05-07T19:47:33.8740166Z 2025-05-07T19:47:33.8740169Z 2025-05-07T19:47:33.8740172Z 2025-05-07T19:47:33.8740176Z 2025-05-07T19:47:33.8740179Z 2025-05-07T19:47:33.8740183Z 2025-05-07T19:47:35.3872302Z ... (more hidden) ... 2025-05-07T19:47:35.3872782Z 2025-05-07T19:47:35.3872787Z 2025-05-07T19:47:35.3872792Z 2025-05-07T19:47:35.3872797Z 2025-05-07T19:47:35.3872801Z 2025-05-07T19:47:35.3872809Z 2025-05-07T19:47:35.3872813Z 2025-05-07T19:47:35.3872818Z 2025-05-07T19:47:35.3872848Z 2025-05-07T19:47:35.3872871Z 2025-05-07T19:47:35.3872875Z 2025-05-07T19:47:35.3872878Z 2025-05-07T19:47:35.3872882Z 2025-05-07T19:47:35.3872885Z 2025-05-07T19:47:35.3872888Z 2025-05-07T19:47:39.9809745Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:47:39.9810172Z 2025-05-07T19:47:39.9810177Z 2025-05-07T19:47:39.9810182Z 2025-05-07T19:47:39.9810187Z 2025-05-07T19:47:39.9810192Z 2025-05-07T19:47:39.9810197Z 2025-05-07T19:47:39.9810201Z 2025-05-07T19:47:39.9810205Z 2025-05-07T19:47:39.9810210Z 2025-05-07T19:47:39.9810214Z 2025-05-07T19:47:39.9810218Z 2025-05-07T19:47:39.9810222Z 2025-05-07T19:47:39.9810227Z 2025-05-07T19:47:39.9810231Z 2025-05-07T19:48:08.5017982Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:48:08.5024099Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:48:08.5024598Z 2025-05-07T19:48:08.5024604Z 2025-05-07T19:48:08.5024636Z 2025-05-07T19:48:08.5024662Z 2025-05-07T19:48:08.5024668Z 2025-05-07T19:48:08.5024793Z 2025-05-07T19:48:08.5024991Z 2025-05-07T19:48:08.5025118Z 2025-05-07T19:48:08.5025124Z 2025-05-07T19:48:08.5025162Z 2025-05-07T19:48:08.5025166Z 2025-05-07T19:48:08.5025262Z 2025-05-07T19:48:08.5025268Z 2025-05-07T19:48:08.5025272Z 2025-05-07T19:48:08.5025316Z 2025-05-07T19:48:08.5025320Z 2025-05-07T19:48:08.5025326Z 2025-05-07T19:48:08.5025331Z 2025-05-07T19:48:08.5025359Z 2025-05-07T19:48:08.5025524Z 2025-05-07T19:48:08.5025886Z  2025-05-07T19:48:08.5026333Z 2025-05-07T19:48:08.5026648Z 2025-05-07T19:48:08.5026998Z  2025-05-07T19:48:08.5027217Z 2025-05-07T19:48:08.5027220Z 2025-05-07T19:48:08.5027398Z  2025-05-07T19:48:08.5027651Z 2025-05-07T19:48:08.5027666Z 2025-05-07T19:48:08.5027676Z 2025-05-07T19:48:08.5029250Z  2025-05-07T19:48:08.5029613Z 2025-05-07T19:48:08.5029616Z 2025-05-07T19:48:08.5029620Z 2025-05-07T19:48:08.5029623Z 2025-05-07T19:48:08.5029805Z  2025-05-07T19:48:08.5030045Z 2025-05-07T19:48:08.5030048Z 2025-05-07T19:48:08.5030052Z 2025-05-07T19:48:08.5030055Z 2025-05-07T19:48:08.5030059Z 2025-05-07T19:48:08.5030238Z  2025-05-07T19:48:08.5030462Z 2025-05-07T19:48:08.5030466Z 2025-05-07T19:48:08.5030470Z 2025-05-07T19:48:08.5030474Z 2025-05-07T19:48:08.5030477Z 2025-05-07T19:48:08.5030497Z 2025-05-07T19:48:08.5030683Z  2025-05-07T19:48:08.5030912Z 2025-05-07T19:48:08.5030916Z 2025-05-07T19:48:08.5030919Z 2025-05-07T19:48:08.5030922Z 2025-05-07T19:48:08.5031223Z 2025-05-07T19:48:08.5031380Z 2025-05-07T19:48:08.5031385Z 2025-05-07T19:48:08.5031600Z  2025-05-07T19:48:08.5031833Z 2025-05-07T19:48:08.5031837Z 2025-05-07T19:48:08.5031840Z 2025-05-07T19:48:08.5031843Z 2025-05-07T19:48:08.5031847Z 2025-05-07T19:48:08.5031851Z 2025-05-07T19:48:08.5031855Z 2025-05-07T19:48:08.5031858Z 2025-05-07T19:48:08.5032069Z  2025-05-07T19:48:08.5032304Z 2025-05-07T19:48:08.5032307Z 2025-05-07T19:48:08.5032311Z 2025-05-07T19:48:08.5032314Z 2025-05-07T19:48:08.5032318Z 2025-05-07T19:48:08.5032321Z 2025-05-07T19:48:08.5032325Z 2025-05-07T19:48:08.5032329Z 2025-05-07T19:48:08.5032332Z 2025-05-07T19:48:08.5032714Z  2025-05-07T19:48:08.5032958Z 2025-05-07T19:48:08.5032962Z 2025-05-07T19:48:08.5032966Z 2025-05-07T19:48:08.5032976Z 2025-05-07T19:48:08.5032980Z 2025-05-07T19:48:08.5032987Z 2025-05-07T19:48:08.5032990Z 2025-05-07T19:48:08.5032994Z 2025-05-07T19:48:08.5032999Z 2025-05-07T19:48:08.5033002Z 2025-05-07T19:48:08.5033223Z  2025-05-07T19:48:08.5033542Z 2025-05-07T19:48:08.5033545Z 2025-05-07T19:48:08.5033549Z 2025-05-07T19:48:08.5033552Z 2025-05-07T19:48:08.5033556Z 2025-05-07T19:48:08.5033559Z 2025-05-07T19:48:08.5033563Z 2025-05-07T19:48:08.5033566Z 2025-05-07T19:48:08.5033570Z 2025-05-07T19:48:08.5033573Z 2025-05-07T19:48:08.5033576Z 2025-05-07T19:48:08.5033811Z  2025-05-07T19:48:08.5034056Z 2025-05-07T19:48:08.5034060Z 2025-05-07T19:48:08.5034064Z 2025-05-07T19:48:08.5034068Z 2025-05-07T19:48:08.5034071Z 2025-05-07T19:48:08.5034075Z 2025-05-07T19:48:08.5034078Z 2025-05-07T19:48:08.5034082Z 2025-05-07T19:48:08.5034085Z 2025-05-07T19:48:08.5034093Z 2025-05-07T19:48:08.5034100Z 2025-05-07T19:48:08.5034103Z 2025-05-07T19:48:08.5034325Z  2025-05-07T19:48:08.5034573Z 2025-05-07T19:48:08.5034576Z 2025-05-07T19:48:08.5034580Z 2025-05-07T19:48:08.5034583Z 2025-05-07T19:48:08.5034587Z 2025-05-07T19:48:08.5034590Z 2025-05-07T19:48:08.5034594Z 2025-05-07T19:48:08.5034597Z 2025-05-07T19:48:08.5034601Z 2025-05-07T19:48:08.5034604Z 2025-05-07T19:48:08.5034607Z 2025-05-07T19:48:08.5034629Z 2025-05-07T19:48:08.5034633Z 2025-05-07T19:48:08.5034839Z  2025-05-07T19:48:08.5035086Z 2025-05-07T19:48:08.5035090Z 2025-05-07T19:48:08.5035093Z 2025-05-07T19:48:08.5035097Z 2025-05-07T19:48:08.5035100Z 2025-05-07T19:48:08.5035104Z 2025-05-07T19:48:08.5035107Z 2025-05-07T19:48:08.5035111Z 2025-05-07T19:48:08.5035131Z 2025-05-07T19:48:08.5035134Z 2025-05-07T19:48:08.5035142Z 2025-05-07T19:48:08.5035149Z 2025-05-07T19:48:08.5035153Z 2025-05-07T19:48:08.5035156Z 2025-05-07T19:48:08.5035370Z  2025-05-07T19:48:08.5035622Z 2025-05-07T19:48:08.5035626Z 2025-05-07T19:48:08.5035630Z 2025-05-07T19:48:08.5035633Z 2025-05-07T19:48:08.5035654Z 2025-05-07T19:48:08.5035658Z 2025-05-07T19:48:08.5035661Z 2025-05-07T19:48:08.5035665Z 2025-05-07T19:48:08.5035668Z 2025-05-07T19:48:08.5035672Z 2025-05-07T19:48:08.5035675Z 2025-05-07T19:48:08.5035678Z 2025-05-07T19:48:08.5035682Z 2025-05-07T19:48:08.5035685Z 2025-05-07T19:48:08.5035688Z 2025-05-07T19:48:08.5035908Z  2025-05-07T19:48:08.5036179Z 2025-05-07T19:48:08.5036183Z 2025-05-07T19:48:08.5036186Z 2025-05-07T19:48:08.5036190Z 2025-05-07T19:48:08.5036193Z 2025-05-07T19:48:08.5036197Z 2025-05-07T19:48:08.5036200Z 2025-05-07T19:48:08.5036279Z 2025-05-07T19:48:08.5036383Z 2025-05-07T19:48:08.5036388Z 2025-05-07T19:48:08.5036392Z 2025-05-07T19:48:08.5036395Z 2025-05-07T19:48:08.5036399Z 2025-05-07T19:48:08.5036402Z 2025-05-07T19:48:08.5036406Z 2025-05-07T19:48:08.5036409Z 2025-05-07T19:48:08.5036664Z  2025-05-07T19:48:08.5036923Z 2025-05-07T19:48:08.5036927Z 2025-05-07T19:48:08.5036931Z 2025-05-07T19:48:08.5036934Z 2025-05-07T19:48:08.5036938Z 2025-05-07T19:48:08.5036942Z 2025-05-07T19:48:08.5036945Z 2025-05-07T19:48:08.5036949Z 2025-05-07T19:48:08.5036952Z 2025-05-07T19:48:08.5036956Z 2025-05-07T19:48:08.5036959Z 2025-05-07T19:48:08.5036962Z 2025-05-07T19:48:08.5036966Z 2025-05-07T19:48:08.5036969Z 2025-05-07T19:48:08.5036973Z 2025-05-07T19:48:08.5036976Z 2025-05-07T19:48:08.5036999Z 2025-05-07T19:48:08.5037234Z  2025-05-07T19:48:08.5037499Z 2025-05-07T19:48:08.5037502Z 2025-05-07T19:48:08.5037506Z 2025-05-07T19:48:08.5037510Z 2025-05-07T19:48:08.5037513Z 2025-05-07T19:48:08.5037516Z 2025-05-07T19:48:08.5037520Z 2025-05-07T19:48:08.5037523Z 2025-05-07T19:48:08.5037546Z 2025-05-07T19:48:08.5037550Z 2025-05-07T19:48:08.5037553Z 2025-05-07T19:48:08.5037556Z 2025-05-07T19:48:08.5037560Z 2025-05-07T19:48:08.5037563Z 2025-05-07T19:48:08.5037566Z 2025-05-07T19:48:08.5037570Z 2025-05-07T19:48:08.5037573Z 2025-05-07T19:48:08.5037577Z 2025-05-07T19:48:08.5037942Z  2025-05-07T19:48:08.5038225Z 2025-05-07T19:48:08.5038229Z 2025-05-07T19:48:08.5038335Z  2025-05-07T19:48:08.5038449Z 2025-05-07T19:48:08.5038453Z 2025-05-07T19:48:08.5038557Z  2025-05-07T19:48:08.5038693Z 2025-05-07T19:48:08.5038697Z 2025-05-07T19:48:08.5038700Z 2025-05-07T19:48:08.5038808Z  2025-05-07T19:48:08.5038930Z 2025-05-07T19:48:08.5038936Z 2025-05-07T19:48:08.5038958Z 2025-05-07T19:48:08.5038962Z 2025-05-07T19:48:08.5039069Z  2025-05-07T19:48:08.5039198Z 2025-05-07T19:48:08.5039202Z 2025-05-07T19:48:08.5039205Z 2025-05-07T19:48:08.5039209Z 2025-05-07T19:48:08.5039212Z 2025-05-07T19:48:08.5039343Z  2025-05-07T19:48:08.5039476Z 2025-05-07T19:48:08.5039479Z 2025-05-07T19:48:08.5039483Z 2025-05-07T19:48:08.5039486Z 2025-05-07T19:48:08.5039490Z 2025-05-07T19:48:08.5039493Z 2025-05-07T19:48:08.5039606Z  2025-05-07T19:48:08.5039761Z 2025-05-07T19:48:08.5039765Z 2025-05-07T19:48:08.5039768Z 2025-05-07T19:48:08.5039772Z 2025-05-07T19:48:08.5039775Z 2025-05-07T19:48:08.5039779Z 2025-05-07T19:48:08.5039782Z 2025-05-07T19:48:08.5039900Z  2025-05-07T19:48:08.5040068Z 2025-05-07T19:48:08.5040072Z 2025-05-07T19:48:08.5040076Z 2025-05-07T19:48:08.5040079Z 2025-05-07T19:48:08.5040082Z 2025-05-07T19:48:08.5040086Z 2025-05-07T19:48:08.5040093Z 2025-05-07T19:48:08.5040100Z 2025-05-07T19:48:08.5040223Z  2025-05-07T19:48:08.5040391Z 2025-05-07T19:48:08.5040395Z 2025-05-07T19:48:08.5040399Z 2025-05-07T19:48:08.5040418Z 2025-05-07T19:48:08.5040422Z 2025-05-07T19:48:08.5040425Z 2025-05-07T19:48:08.5040429Z 2025-05-07T19:48:08.5040432Z 2025-05-07T19:48:08.5040435Z 2025-05-07T19:48:08.5040560Z  2025-05-07T19:48:08.5040726Z 2025-05-07T19:48:08.5040729Z 2025-05-07T19:48:08.5040733Z 2025-05-07T19:48:08.5040736Z 2025-05-07T19:48:08.5040741Z 2025-05-07T19:48:08.5040761Z 2025-05-07T19:48:08.5040764Z 2025-05-07T19:48:08.5040768Z 2025-05-07T19:48:08.5040771Z 2025-05-07T19:48:08.5040775Z 2025-05-07T19:48:08.5040908Z  2025-05-07T19:48:08.5041083Z 2025-05-07T19:48:08.5041087Z 2025-05-07T19:48:08.5041090Z 2025-05-07T19:48:08.5041094Z 2025-05-07T19:48:08.5041098Z 2025-05-07T19:48:08.5041102Z 2025-05-07T19:48:08.5041122Z 2025-05-07T19:48:08.5041188Z 2025-05-07T19:48:08.5041192Z 2025-05-07T19:48:08.5041257Z 2025-05-07T19:48:08.5041261Z 2025-05-07T19:48:08.5041396Z  2025-05-07T19:48:08.5041582Z 2025-05-07T19:48:08.5041585Z 2025-05-07T19:48:08.5041589Z 2025-05-07T19:48:08.5041592Z 2025-05-07T19:48:08.5041596Z 2025-05-07T19:48:08.5041599Z 2025-05-07T19:48:08.5041620Z 2025-05-07T19:48:08.5041623Z 2025-05-07T19:48:08.5041626Z 2025-05-07T19:48:08.5041630Z 2025-05-07T19:48:08.5041633Z 2025-05-07T19:48:08.5041637Z 2025-05-07T19:48:08.5041773Z  2025-05-07T19:48:08.5041967Z 2025-05-07T19:48:08.5041971Z 2025-05-07T19:48:08.5041976Z 2025-05-07T19:48:08.5041980Z 2025-05-07T19:48:08.5041983Z 2025-05-07T19:48:08.5042003Z 2025-05-07T19:48:08.5042007Z 2025-05-07T19:48:08.5042011Z 2025-05-07T19:48:08.5042014Z 2025-05-07T19:48:08.5042017Z 2025-05-07T19:48:08.5042021Z 2025-05-07T19:48:08.5042024Z 2025-05-07T19:48:08.5042027Z 2025-05-07T19:48:08.5042170Z  2025-05-07T19:48:08.5042378Z 2025-05-07T19:48:08.5042382Z 2025-05-07T19:48:08.5042402Z 2025-05-07T19:48:08.5042405Z 2025-05-07T19:48:08.5042409Z 2025-05-07T19:48:08.5042412Z 2025-05-07T19:48:08.5042416Z 2025-05-07T19:48:08.5042419Z 2025-05-07T19:48:08.5042422Z 2025-05-07T19:48:08.5042426Z 2025-05-07T19:48:08.5042429Z 2025-05-07T19:48:08.5042432Z 2025-05-07T19:48:08.5042437Z 2025-05-07T19:48:08.5042440Z 2025-05-07T19:48:08.5042588Z  2025-05-07T19:48:08.5042814Z 2025-05-07T19:48:08.5042818Z 2025-05-07T19:48:08.5042822Z 2025-05-07T19:48:08.5042825Z 2025-05-07T19:48:08.5042829Z 2025-05-07T19:48:08.5042832Z 2025-05-07T19:48:08.5042835Z 2025-05-07T19:48:08.5042839Z 2025-05-07T19:48:08.5042842Z 2025-05-07T19:48:08.5042846Z 2025-05-07T19:48:08.5042849Z 2025-05-07T19:48:08.5042853Z 2025-05-07T19:48:08.5042856Z 2025-05-07T19:48:08.5042859Z 2025-05-07T19:48:08.5042863Z 2025-05-07T19:48:08.5043013Z  2025-05-07T19:48:08.5043252Z 2025-05-07T19:48:08.5043260Z 2025-05-07T19:48:08.5043264Z 2025-05-07T19:48:08.5043268Z 2025-05-07T19:48:08.5043272Z 2025-05-07T19:48:08.5043275Z 2025-05-07T19:48:08.5043279Z 2025-05-07T19:48:08.5043283Z 2025-05-07T19:48:08.5043286Z 2025-05-07T19:48:08.5043289Z 2025-05-07T19:48:08.5043293Z 2025-05-07T19:48:08.5043296Z 2025-05-07T19:48:08.5043300Z 2025-05-07T19:48:08.5043303Z 2025-05-07T19:48:08.5043307Z 2025-05-07T19:48:08.5043311Z 2025-05-07T19:48:08.5043490Z  2025-05-07T19:48:08.5043829Z 2025-05-07T19:48:08.5043832Z 2025-05-07T19:48:08.5043836Z 2025-05-07T19:48:08.5043839Z 2025-05-07T19:48:08.5043842Z 2025-05-07T19:48:08.5043846Z 2025-05-07T19:48:08.5043849Z 2025-05-07T19:48:08.5043853Z 2025-05-07T19:48:08.5043856Z 2025-05-07T19:48:08.5043860Z 2025-05-07T19:48:08.5043864Z 2025-05-07T19:48:08.5043867Z 2025-05-07T19:48:08.5043870Z 2025-05-07T19:48:08.5043891Z 2025-05-07T19:48:08.5043898Z 2025-05-07T19:48:08.5043901Z 2025-05-07T19:48:08.5043907Z 2025-05-07T19:48:08.5044065Z  2025-05-07T19:48:08.5044285Z 2025-05-07T19:48:08.5044288Z 2025-05-07T19:48:08.5044292Z 2025-05-07T19:48:08.5044296Z 2025-05-07T19:48:08.5044299Z 2025-05-07T19:48:08.5044303Z 2025-05-07T19:48:08.5044323Z 2025-05-07T19:48:08.5044327Z 2025-05-07T19:48:08.5044330Z 2025-05-07T19:48:08.5044333Z 2025-05-07T19:48:08.5044337Z 2025-05-07T19:48:08.5044340Z 2025-05-07T19:48:08.5044343Z 2025-05-07T19:48:08.5044347Z 2025-05-07T19:48:08.5044350Z 2025-05-07T19:48:08.5044354Z 2025-05-07T19:48:08.5044357Z 2025-05-07T19:48:08.5044360Z 2025-05-07T19:48:08.5044524Z  2025-05-07T19:48:08.5044767Z 2025-05-07T19:48:08.5044770Z 2025-05-07T19:48:08.5044871Z  2025-05-07T19:48:08.5044977Z 2025-05-07T19:48:08.5044980Z 2025-05-07T19:48:08.5045081Z  2025-05-07T19:48:08.5045209Z 2025-05-07T19:48:08.5045213Z 2025-05-07T19:48:08.5045274Z 2025-05-07T19:48:08.5045438Z  2025-05-07T19:48:08.5045553Z 2025-05-07T19:48:08.5045557Z 2025-05-07T19:48:08.5045576Z 2025-05-07T19:48:08.5045580Z 2025-05-07T19:48:08.5045684Z  2025-05-07T19:48:08.5045803Z 2025-05-07T19:48:08.5045807Z 2025-05-07T19:48:08.5045810Z 2025-05-07T19:48:08.5045814Z 2025-05-07T19:48:08.5045817Z 2025-05-07T19:48:08.5045940Z  2025-05-07T19:48:08.5046065Z 2025-05-07T19:48:08.5046069Z 2025-05-07T19:48:08.5046073Z 2025-05-07T19:48:08.5046076Z 2025-05-07T19:48:08.5046079Z 2025-05-07T19:48:08.5046083Z 2025-05-07T19:48:08.5046193Z  2025-05-07T19:48:08.5046342Z 2025-05-07T19:48:08.5046346Z 2025-05-07T19:48:08.5046350Z 2025-05-07T19:48:08.5046353Z 2025-05-07T19:48:08.5046357Z 2025-05-07T19:48:08.5046360Z 2025-05-07T19:48:08.5046363Z 2025-05-07T19:48:08.5046477Z  2025-05-07T19:48:08.5046636Z 2025-05-07T19:48:08.5046640Z 2025-05-07T19:48:08.5046643Z 2025-05-07T19:48:08.5046650Z 2025-05-07T19:48:08.5046654Z 2025-05-07T19:48:08.5046660Z 2025-05-07T19:48:08.5046664Z 2025-05-07T19:48:08.5046667Z 2025-05-07T19:48:08.5046785Z  2025-05-07T19:48:08.5046936Z 2025-05-07T19:48:08.5046940Z 2025-05-07T19:48:08.5046943Z 2025-05-07T19:48:08.5046962Z 2025-05-07T19:48:08.5046966Z 2025-05-07T19:48:08.5046969Z 2025-05-07T19:48:08.5046973Z 2025-05-07T19:48:08.5046976Z 2025-05-07T19:48:08.5046980Z 2025-05-07T19:48:08.5047103Z  2025-05-07T19:48:08.5047262Z 2025-05-07T19:48:08.5047266Z 2025-05-07T19:48:08.5047269Z 2025-05-07T19:48:08.5047272Z 2025-05-07T19:48:08.5047276Z 2025-05-07T19:48:08.5047295Z 2025-05-07T19:48:08.5047298Z 2025-05-07T19:48:08.5047302Z 2025-05-07T19:48:08.5047305Z 2025-05-07T19:48:08.5047308Z 2025-05-07T19:48:08.5047434Z  2025-05-07T19:48:08.5047603Z 2025-05-07T19:48:08.5047606Z 2025-05-07T19:48:08.5047610Z 2025-05-07T19:48:08.5047613Z 2025-05-07T19:48:08.5047616Z 2025-05-07T19:48:08.5047624Z 2025-05-07T19:48:08.5047647Z 2025-05-07T19:48:08.5047650Z 2025-05-07T19:48:08.5047654Z 2025-05-07T19:48:08.5047657Z 2025-05-07T19:48:08.5047661Z 2025-05-07T19:48:08.5047791Z  2025-05-07T19:48:08.5047969Z 2025-05-07T19:48:08.5047973Z 2025-05-07T19:48:08.5047976Z 2025-05-07T19:48:08.5047980Z 2025-05-07T19:48:08.5047983Z 2025-05-07T19:48:08.5047986Z 2025-05-07T19:48:08.5048006Z 2025-05-07T19:48:08.5048009Z 2025-05-07T19:48:08.5048013Z 2025-05-07T19:48:08.5048016Z 2025-05-07T19:48:08.5048020Z 2025-05-07T19:48:08.5048023Z 2025-05-07T19:48:08.5048171Z  done 2025-05-07T19:48:08.6045185Z Preparing transaction: / done 2025-05-07T19:48:08.8057906Z Verifying transaction: \ | done 2025-05-07T19:48:09.0091302Z Executing transaction: - \ done 2025-05-07T19:48:11.0485395Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:48:11.0872280Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib/stubs ... 2025-05-07T19:48:12.9703932Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/lib/stubs 2025-05-07T19:48:12.9704645Z 2025-05-07T19:48:13.3797216Z 2025-05-07T19:48:13.3801971Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:48:13.4156402Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:48:13.4156971Z 2025-05-07T19:48:13.8362311Z 2025-05-07T19:48:13.8362738Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:48:13.8363799Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:48:13.8364619Z 2025-05-07T19:48:14.2384614Z 2025-05-07T19:48:16.2025698Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/include/cuda_runtime.h 2025-05-07T19:48:18.1773784Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:48:20.1454502Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:48:22.0827203Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:48:23.9117870Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:48:23.9118189Z 2025-05-07T19:48:23.9711369Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:48:27.6623954Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:27.6624681Z Target: x86_64-conda-linux-gnu 2025-05-07T19:48:27.6624973Z Thread model: posix 2025-05-07T19:48:27.6625321Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:48:27.6626014Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:48:27.6626499Z 2025-05-07T19:48:27.7210755Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:48:31.4921147Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:48:31.4921723Z 2025-05-07T19:48:31.4937763Z 2025-05-07T19:48:31.4955290Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:48:31.4955844Z 2025-05-07T19:48:31.4979626Z 2025-05-07T19:48:31.4999708Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:48:31.5000251Z 2025-05-07T19:48:31.5015732Z 2025-05-07T19:48:31.5034763Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:48:31.5036390Z 2025-05-07T19:48:31.5049412Z 2025-05-07T19:48:31.5050445Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:48:31.5051189Z 2025-05-07T19:48:31.5066782Z total 36 2025-05-07T19:48:31.5068704Z drwxr-xr-x. 2 root root 188 May 7 19:45 . 2025-05-07T19:48:31.5069777Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:48:31.5071041Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:48:31.5072466Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:48:31.5072939Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:48:31.5073497Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:48:31.5073925Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:48:31.5074372Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:48:31.5074643Z 2025-05-07T19:48:31.5074830Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:48:31.5075124Z 2025-05-07T19:48:33.4197671Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:33.4200321Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:48:33.4201331Z 2025-05-07T19:48:33.4201478Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:48:35.2755196Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:48:35.2757902Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:48:35.2760165Z 2025-05-07T19:48:35.6843967Z 2025-05-07T19:48:35.6844794Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:48:35.6845641Z 2025-05-07T19:48:37.4933573Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:48:37.4934441Z 2025-05-07T19:48:37.5680910Z 2025-05-07T19:48:37.5681389Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:48:37.5681949Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:48:37.5682314Z 2025-05-07T19:48:39.4696528Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:48:39.4696894Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:48:39.4697177Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:48:39.4697464Z #define ADJ_MICRO 0x1000 2025-05-07T19:48:39.4697716Z #define ADJ_NANO 0x2000 2025-05-07T19:48:39.4697976Z #define ADJ_OFFSET 0x0001 2025-05-07T19:48:39.4698261Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:48:39.4698567Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:48:39.4698865Z #define ADJ_STATUS 0x0010 2025-05-07T19:48:39.4699116Z #define ADJ_TAI 0x0080 2025-05-07T19:48:39.4699369Z #define ADJ_TICK 0x4000 2025-05-07T19:48:39.4699635Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:48:39.4699925Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:48:39.4700259Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:48:39.4700598Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:48:39.4700921Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:48:39.4701265Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:48:39.4701596Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:48:39.4701867Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:48:39.4702139Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:48:39.4702413Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:48:39.4702702Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:48:39.4703140Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:39.4703412Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:48:39.4703690Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:48:39.4703973Z #define CLOCK_BOOTTIME 7 2025-05-07T19:48:39.4704259Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:48:39.4704549Z #define CLOCK_MONOTONIC 1 2025-05-07T19:48:39.4704817Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:48:39.4705122Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:48:39.4705433Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:48:39.4705739Z #define CLOCK_REALTIME 0 2025-05-07T19:48:39.4705999Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:48:39.4706293Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:48:39.4706565Z #define CLOCK_TAI 11 2025-05-07T19:48:39.4706834Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:48:39.4707124Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:48:39.4707407Z #define CUDARTAPI 2025-05-07T19:48:39.4707643Z #define CUDARTAPI_CDECL 2025-05-07T19:48:39.4707914Z #define CUDART_CB 2025-05-07T19:48:39.4708171Z #define CUDART_DEVICE __device__ 2025-05-07T19:48:39.4708456Z #define CUDART_VERSION 11080 2025-05-07T19:48:39.4708752Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:48:39.4709055Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:48:39.4709352Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:48:39.4709642Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:48:39.4709931Z #define DOMAIN 1 2025-05-07T19:48:39.4710158Z #define EOF (-1) 2025-05-07T19:48:39.4710409Z #define EXIT_FAILURE 1 2025-05-07T19:48:39.4710660Z #define EXIT_SUCCESS 0 2025-05-07T19:48:39.4710948Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:48:39.4711320Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:48:39.4711697Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:48:39.4712110Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:48:39.4712735Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:48:39.4713089Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:48:39.4713426Z #define FILENAME_MAX 4096 2025-05-07T19:48:39.4713790Z #define FOPEN_MAX 16 2025-05-07T19:48:39.4714062Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:48:39.4714414Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:48:39.4714755Z #define FP_INFINITE 1 2025-05-07T19:48:39.4715015Z #define FP_NAN 0 2025-05-07T19:48:39.4715288Z #define FP_NORMAL 4 2025-05-07T19:48:39.4715543Z #define FP_SUBNORMAL 3 2025-05-07T19:48:39.4716108Z #define FP_ZERO 2 2025-05-07T19:48:39.4716476Z #define HOST_NAME_MAX 64 2025-05-07T19:48:39.4716779Z #define HUGE 3.40282347e+38F 2025-05-07T19:48:39.4717084Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:48:39.4717464Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:48:39.4717821Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:48:39.4718202Z #define INFINITY (__builtin_inff()) 2025-05-07T19:48:39.4718557Z #define INT_MAX __INT_MAX__ 2025-05-07T19:48:39.4718862Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:48:39.4719192Z #define IOV_MAX 1024 2025-05-07T19:48:39.4719462Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:48:39.4719815Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:48:39.4720146Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:39.4720511Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:48:39.4720853Z #define LOGIN_NAME_MAX 256 2025-05-07T19:48:39.4721158Z #define LONG_BIT 64 2025-05-07T19:48:39.4721431Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:39.4721828Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:48:39.4722211Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:48:39.4722521Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:48:39.4722864Z #define L_ctermid 9 2025-05-07T19:48:39.4723115Z #define L_cuserid 9 2025-05-07T19:48:39.4723391Z #define L_tmpnam 20 2025-05-07T19:48:39.4723650Z #define MATH_ERREXCEPT 2 2025-05-07T19:48:39.4723951Z #define MATH_ERRNO 1 2025-05-07T19:48:39.4724212Z #define MAX_CANON 255 2025-05-07T19:48:39.4724493Z #define MAX_INPUT 255 2025-05-07T19:48:39.4724789Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:48:39.4725160Z #define MB_LEN_MAX 16 2025-05-07T19:48:39.4725439Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:48:39.4725793Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:48:39.4726110Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:48:39.4726402Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:48:39.4726715Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:48:39.4726997Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:48:39.4727283Z #define MOD_NANO ADJ_NANO 2025-05-07T19:48:39.4727543Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:48:39.4727828Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:48:39.4728094Z #define MOD_TAI ADJ_TAI 2025-05-07T19:48:39.4728364Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:48:39.4728651Z #define MQ_PRIO_MAX 32768 2025-05-07T19:48:39.4728920Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:48:39.4729260Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:48:39.4729598Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:48:39.4729935Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:48:39.4730275Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:48:39.4730640Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:48:39.4730992Z #define M_E 2.7182818284590452354 2025-05-07T19:48:39.4731311Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:48:39.4731636Z #define M_LN10 2.30258509299404568402 2025-05-07T19:48:39.4731987Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:48:39.4732334Z #define M_LN2 0.69314718055994530942 2025-05-07T19:48:39.4732652Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:48:39.4733005Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:48:39.4733345Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:48:39.4733713Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:48:39.4734045Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:48:39.4734407Z #define M_PI 3.14159265358979323846 2025-05-07T19:48:39.4734691Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:48:39.4735026Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:48:39.4735373Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:48:39.4735692Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:48:39.4736072Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:48:39.4736585Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:48:39.4736950Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:48:39.4737301Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:48:39.4737648Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:48:39.4737984Z #define NAME_MAX 255 2025-05-07T19:48:39.4738251Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:48:39.4738551Z #define NFDBITS __NFDBITS 2025-05-07T19:48:39.4738805Z #define NGROUPS_MAX 65536 2025-05-07T19:48:39.4739083Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:48:39.4739374Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:48:39.4739676Z #define NL_MSGMAX INT_MAX 2025-05-07T19:48:39.4739926Z #define NL_NMAX INT_MAX 2025-05-07T19:48:39.4740184Z #define NL_SETMAX INT_MAX 2025-05-07T19:48:39.4740436Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:48:39.4740716Z #define NULL __null 2025-05-07T19:48:39.4740956Z #define NZERO 20 2025-05-07T19:48:39.4741177Z #define OVERFLOW 3 2025-05-07T19:48:39.4741429Z #define PATH_MAX 4096 2025-05-07T19:48:39.4741690Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:48:39.4741985Z #define PIPE_BUF 4096 2025-05-07T19:48:39.4742223Z #define PLOSS 6 2025-05-07T19:48:39.4742604Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:48:39.4743058Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:48:39.4743355Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:48:39.4743636Z #define P_tmpdir "/tmp" 2025-05-07T19:48:39.4743908Z #define RAND_MAX 2147483647 2025-05-07T19:48:39.4744188Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:48:39.4744447Z #define RTSIG_MAX 32 2025-05-07T19:48:39.4744711Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:39.4744995Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:48:39.4745315Z #define SEEK_CUR 1 2025-05-07T19:48:39.4745561Z #define SEEK_DATA 3 2025-05-07T19:48:39.4745824Z #define SEEK_END 2 2025-05-07T19:48:39.4746048Z #define SEEK_HOLE 4 2025-05-07T19:48:39.4746286Z #define SEEK_SET 0 2025-05-07T19:48:39.4746529Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:48:39.4746837Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:48:39.4747138Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:48:39.4747421Z #define SING 2 2025-05-07T19:48:39.4747663Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:48:39.4747914Z #define STA_CLK 0x8000 2025-05-07T19:48:39.4748176Z #define STA_CLOCKERR 0x1000 2025-05-07T19:48:39.4748432Z #define STA_DEL 0x0020 2025-05-07T19:48:39.4748681Z #define STA_FLL 0x0008 2025-05-07T19:48:39.4748921Z #define STA_FREQHOLD 0x0080 2025-05-07T19:48:39.4749190Z #define STA_INS 0x0010 2025-05-07T19:48:39.4749424Z #define STA_MODE 0x4000 2025-05-07T19:48:39.4749680Z #define STA_NANO 0x2000 2025-05-07T19:48:39.4749921Z #define STA_PLL 0x0001 2025-05-07T19:48:39.4750178Z #define STA_PPSERROR 0x0800 2025-05-07T19:48:39.4750459Z #define STA_PPSFREQ 0x0002 2025-05-07T19:48:39.4750721Z #define STA_PPSJITTER 0x0200 2025-05-07T19:48:39.4751007Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:48:39.4751279Z #define STA_PPSTIME 0x0004 2025-05-07T19:48:39.4751568Z #define STA_PPSWANDER 0x0400 2025-05-07T19:48:39.4752152Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:48:39.4752862Z #define STA_UNSYNC 0x0040 2025-05-07T19:48:39.4753117Z #define TIMER_ABSTIME 1 2025-05-07T19:48:39.4753444Z #define TIME_UTC 1 2025-05-07T19:48:39.4753664Z #define TLOSS 5 2025-05-07T19:48:39.4753897Z #define TMP_MAX 238328 2025-05-07T19:48:39.4754152Z #define TTY_NAME_MAX 32 2025-05-07T19:48:39.4754410Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:48:39.4754730Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:48:39.4755058Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:48:39.4755452Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:48:39.4755808Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:48:39.4756122Z #define UNDERFLOW 4 2025-05-07T19:48:39.4756367Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:48:39.4756787Z #define WCONTINUED 8 2025-05-07T19:48:39.4757089Z #define WEXITED 4 2025-05-07T19:48:39.4757426Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:48:39.4757936Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:48:39.4758412Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:48:39.4758891Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:48:39.4759369Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:48:39.4759757Z #define WNOHANG 1 2025-05-07T19:48:39.4759985Z #define WNOWAIT 0x01000000 2025-05-07T19:48:39.4760249Z #define WORD_BIT 32 2025-05-07T19:48:39.4760473Z #define WSTOPPED 2 2025-05-07T19:48:39.4760785Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:48:39.4761228Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:48:39.4761584Z #define WUNTRACED 2 2025-05-07T19:48:39.4761834Z #define XATTR_LIST_MAX 65536 2025-05-07T19:48:39.4762108Z #define XATTR_NAME_MAX 255 2025-05-07T19:48:39.4762385Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:48:39.4762666Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:48:39.4762978Z #define _ACRTIMP 2025-05-07T19:48:39.4763199Z #define _ALLOCA_H 1 2025-05-07T19:48:39.4763433Z #define _ASSERT_H 1 2025-05-07T19:48:39.4763667Z #define _ATFILE_SOURCE 1 2025-05-07T19:48:39.4763934Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:48:39.4764208Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:48:39.4764478Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:48:39.4764766Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:48:39.4765139Z #define _BITS_TIMEX_H 1 2025-05-07T19:48:39.4765390Z #define _BITS_TIME_H 1 2025-05-07T19:48:39.4765632Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:48:39.4765898Z #define _BITS_TYPES_H 1 2025-05-07T19:48:39.4766134Z #define _BSD_SOURCE 1 2025-05-07T19:48:39.4766384Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:48:39.4766640Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:48:39.4766909Z #define _CRTIMP 2025-05-07T19:48:39.4767148Z #define _ENDIAN_H 1 2025-05-07T19:48:39.4767380Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:48:39.4767668Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:48:39.4767937Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:48:39.4768195Z #define _FEATURES_H 1 2025-05-07T19:48:39.4768433Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:48:39.4768697Z #define _GCC_LIMITS_H_ 2025-05-07T19:48:39.4768980Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:48:39.4769466Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:39.4769915Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:48:39.4770222Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:48:39.4770525Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:48:39.4770809Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:48:39.4771112Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:48:39.4771405Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:48:39.4771729Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:48:39.4772074Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:48:39.4772557Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:39.4773002Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:48:39.4773303Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:48:39.4773581Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:48:39.4773907Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:39.4774242Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:48:39.4774539Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:48:39.4774841Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:48:39.4775128Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:48:39.4775430Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:48:39.4775804Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:48:39.4776220Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:48:39.4776526Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:48:39.4777057Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:48:39.4777395Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:48:39.4777775Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:48:39.4778168Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:48:39.4778602Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:48:39.4779081Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:48:39.4779396Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:48:39.4779700Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:48:39.4779969Z #define _GLIBCXX_CMATH 1 2025-05-07T19:48:39.4780269Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:48:39.4780625Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:48:39.4800399Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:48:39.4800734Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:48:39.4801026Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:48:39.4801328Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:48:39.4801708Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:48:39.4802040Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:48:39.4802372Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:48:39.4802709Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:48:39.4803049Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:48:39.4803454Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:48:39.4803894Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:48:39.4804609Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:48:39.4805146Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:48:39.4805478Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:48:39.4805760Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:48:39.4806084Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:48:39.4806422Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:48:39.4806736Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:48:39.4807165Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:48:39.4807601Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:48:39.4807928Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:48:39.4808216Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:48:39.4808513Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:48:39.4808915Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:48:39.4809311Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:48:39.4809632Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:48:39.4809925Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:48:39.4810851Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:48:39.4811773Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:48:39.4812067Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:48:39.4812358Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:48:39.4812656Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:48:39.4812959Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:48:39.4813227Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:48:39.4813527Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:48:39.4813841Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:48:39.4814124Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:48:39.4814390Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:48:39.4814671Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:48:39.4814957Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:48:39.4815308Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:48:39.4815756Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:48:39.4816066Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:48:39.4816419Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:48:39.4816949Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:48:39.4817409Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:48:39.4817707Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:48:39.4818011Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:48:39.4818268Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:48:39.4818549Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:48:39.4818834Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:48:39.4819084Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:48:39.4819355Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:48:39.4819610Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:48:39.4819882Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:48:39.4820147Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:48:39.4820425Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:48:39.4820723Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:48:39.4821064Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:48:39.4821336Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:48:39.4821605Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:48:39.4821874Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:48:39.4822130Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:48:39.4822402Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:48:39.4822659Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:48:39.4822931Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:48:39.4823187Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:48:39.4823462Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:48:39.4823720Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:48:39.4823990Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:48:39.4824246Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:48:39.4824517Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:48:39.4824787Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:48:39.4825041Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:48:39.4825315Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:48:39.4825574Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:48:39.4825859Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:48:39.4826112Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:48:39.4826389Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:48:39.4826647Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:48:39.4826915Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:48:39.4827168Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:48:39.4827445Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:48:39.4827739Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:48:39.4828012Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:48:39.4828285Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:48:39.4828538Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:48:39.4828806Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:48:39.4829058Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:48:39.4829330Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:48:39.4829589Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:48:39.4829874Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:48:39.4830150Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:48:39.4830424Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:48:39.4830702Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:48:39.4830978Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:48:39.4831272Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:48:39.4831555Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:48:39.4831836Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:48:39.4832105Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:48:39.4832469Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:48:39.4832927Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:48:39.4833263Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:48:39.4833612Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:48:39.4833918Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:48:39.4834213Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:48:39.4834488Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:48:39.4834775Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:48:39.4835050Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:48:39.4835357Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:48:39.4835646Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:48:39.4839495Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:48:39.4839817Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:48:39.4840102Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:48:39.4840376Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:48:39.4840683Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:48:39.4841009Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:48:39.4841318Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:48:39.4841644Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:48:39.4841948Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:48:39.4842239Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:48:39.4842527Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:48:39.4842862Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:48:39.4843173Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:48:39.4843459Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:48:39.4843739Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:48:39.4844035Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:48:39.4844336Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:48:39.4844611Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:48:39.4844900Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:48:39.4845284Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:48:39.4845563Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:48:39.4845825Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:48:39.4846104Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:48:39.4846382Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:48:39.4846649Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:48:39.4846944Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:48:39.4847205Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:48:39.4847491Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:48:39.4847781Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:48:39.4848076Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:48:39.4848350Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:48:39.4848645Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:48:39.4848932Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:48:39.4849238Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:48:39.4849534Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:48:39.4849806Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:48:39.4850090Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:48:39.4850386Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:48:39.4850719Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:48:39.4851000Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:48:39.4851361Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:48:39.4851747Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:48:39.4852051Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:48:39.4852348Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:48:39.4852643Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:48:39.4852952Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:48:39.4853239Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:48:39.4853547Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:48:39.4853841Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:48:39.4854152Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:48:39.4854449Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:48:39.4854849Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:48:39.4855124Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:48:39.4855409Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:48:39.4855679Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:48:39.4855930Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:48:39.4856198Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:48:39.4856451Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:48:39.4856722Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:48:39.4856983Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:48:39.4857246Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:48:39.4857509Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:48:39.4857789Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:48:39.4858057Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:48:39.4858331Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:48:39.4858767Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:48:39.4859033Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:48:39.4859304Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:48:39.4859562Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:48:39.4859834Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:48:39.4860096Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:48:39.4860374Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:48:39.4860639Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:48:39.4860894Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:48:39.4861150Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:48:39.4861433Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:48:39.4861930Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:48:39.4862534Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:48:39.4862956Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:48:39.4863215Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:48:39.4863503Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:48:39.4863876Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:48:39.4864373Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:48:39.4864825Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:48:39.4865126Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:48:39.4865495Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:48:39.4866033Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:48:39.4866527Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:48:39.4866834Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:48:39.4867169Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:48:39.4867519Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:48:39.4867847Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:48:39.4868220Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:48:39.4868588Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:48:39.4869004Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:48:39.4869393Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:48:39.4869678Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:48:39.4869944Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:48:39.4870274Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:48:39.4870671Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:48:39.4871081Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:48:39.4871401Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:48:39.4871727Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:48:39.4872099Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:48:39.4872468Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:48:39.4872971Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:48:39.4873317Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:48:39.4873606Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:48:39.4873882Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:48:39.4874182Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:48:39.4874464Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:48:39.4874773Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:48:39.4875075Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:48:39.4875339Z #define _GLIBCXX_STD_A std 2025-05-07T19:48:39.4875613Z #define _GLIBCXX_STD_C std 2025-05-07T19:48:39.4875870Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:48:39.4876141Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:48:39.4876457Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:48:39.4876867Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:48:39.4877218Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:48:39.4877547Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:48:39.4877927Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:48:39.4878444Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:48:39.4878786Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:48:39.4879102Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:48:39.4879428Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:48:39.4879728Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:48:39.4880082Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:48:39.4880431Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:48:39.4880789Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:48:39.4881108Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:48:39.4881442Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:48:39.4881787Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:48:39.4882113Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:48:39.4882390Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:48:39.4882664Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:48:39.4882959Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:48:39.4883256Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:48:39.4883607Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:48:39.4883990Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:48:39.4884309Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:48:39.4884621Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:48:39.4884920Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:48:39.4885268Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:48:39.4885645Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:48:39.4886198Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:48:39.4886502Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:48:39.4886867Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:48:39.4887281Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:48:39.4887713Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:48:39.4888093Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:48:39.4888416Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:48:39.4888751Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:48:39.4889071Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:48:39.4889376Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:48:39.4889679Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:48:39.4889988Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:48:39.4890274Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:48:39.4890562Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:48:39.4890838Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:48:39.4891132Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:48:39.4891426Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:48:39.4891727Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:48:39.4892027Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:48:39.4892298Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:48:39.4892578Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:48:39.4892860Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:48:39.4893146Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:48:39.4893448Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:48:39.4893777Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:48:39.4894086Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:48:39.4894388Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:48:39.4894686Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:48:39.4894994Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:48:39.4895327Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:48:39.4895610Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:48:39.4895916Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:48:39.4896285Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:48:39.4896689Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:48:39.4896968Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:48:39.4897260Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:48:39.4897544Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:48:39.4897850Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:48:39.4898403Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:48:39.4898762Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:48:39.4899217Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:48:39.4899619Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:48:39.4899887Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:48:39.4900151Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:48:39.4900408Z #define _GNU_SOURCE 1 2025-05-07T19:48:39.4900643Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:48:39.4900932Z #define _G_BUFSIZ 8192 2025-05-07T19:48:39.4901412Z #define _G_HAVE_MMAP 1 2025-05-07T19:48:39.4901651Z #define _G_HAVE_MREMAP 1 2025-05-07T19:48:39.4901942Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:48:39.4902300Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:48:39.4902568Z #define _G_config_h 1 2025-05-07T19:48:39.4902811Z #define _G_va_list __gnuc_va_list 2025-05-07T19:48:39.4903089Z #define _INITIALIZER_LIST 2025-05-07T19:48:39.4903323Z #define _IOFBF 0 2025-05-07T19:48:39.4903548Z #define _IOLBF 1 2025-05-07T19:48:39.4903755Z #define _IONBF 2 2025-05-07T19:48:39.4903979Z #define _IOS_APPEND 8 2025-05-07T19:48:39.4904200Z #define _IOS_ATEND 4 2025-05-07T19:48:39.4904425Z #define _IOS_BIN 128 2025-05-07T19:48:39.4904636Z #define _IOS_INPUT 1 2025-05-07T19:48:39.4904871Z #define _IOS_NOCREATE 32 2025-05-07T19:48:39.4905104Z #define _IOS_NOREPLACE 64 2025-05-07T19:48:39.4905347Z #define _IOS_OUTPUT 2 2025-05-07T19:48:39.4905564Z #define _IOS_TRUNC 16 2025-05-07T19:48:39.4905801Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:48:39.4906114Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:48:39.4906446Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:48:39.4906720Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:48:39.4906979Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:48:39.4907258Z #define _IO_DEC 020 2025-05-07T19:48:39.4907479Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:48:39.4907762Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:48:39.4908010Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:48:39.4908264Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:48:39.4908499Z #define _IO_FIXED 010000 2025-05-07T19:48:39.4908744Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:48:39.4908996Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:48:39.4909248Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:48:39.4909542Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:48:39.4909836Z #define _IO_HEX 0100 2025-05-07T19:48:39.4910071Z #define _IO_INTERNAL 010 2025-05-07T19:48:39.4910305Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:48:39.4910569Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:48:39.4910828Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:48:39.4911084Z #define _IO_LEFT 02 2025-05-07T19:48:39.4911296Z #define _IO_LINE_BUF 0x200 2025-05-07T19:48:39.4911551Z #define _IO_LINKED 0x80 2025-05-07T19:48:39.4911778Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:48:39.4912046Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:48:39.4912383Z #define _IO_NO_READS 4 2025-05-07T19:48:39.4912622Z #define _IO_NO_WRITES 8 2025-05-07T19:48:39.4913037Z #define _IO_OCT 040 2025-05-07T19:48:39.4913421Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:48:39.4913884Z #define _IO_RIGHT 04 2025-05-07T19:48:39.4914123Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:48:39.4914402Z #define _IO_SHOWBASE 0200 2025-05-07T19:48:39.4914660Z #define _IO_SHOWPOINT 0400 2025-05-07T19:48:39.4914931Z #define _IO_SHOWPOS 02000 2025-05-07T19:48:39.4915178Z #define _IO_SKIPWS 01 2025-05-07T19:48:39.4915430Z #define _IO_STDIO 040000 2025-05-07T19:48:39.4915685Z #define _IO_STDIO_H 2025-05-07T19:48:39.4915932Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:48:39.4916209Z #define _IO_UNBUFFERED 2 2025-05-07T19:48:39.4916467Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:48:39.4916751Z #define _IO_UNITBUF 020000 2025-05-07T19:48:39.4917009Z #define _IO_UPPERCASE 01000 2025-05-07T19:48:39.4917271Z #define _IO_USER_BUF 1 2025-05-07T19:48:39.4917514Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:48:39.4917955Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:48:39.4918276Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:48:39.4918698Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:48:39.4919203Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:48:39.4919629Z #define _IO_file_flags _flags 2025-05-07T19:48:39.4919912Z #define _IO_flockfile(_fp) 2025-05-07T19:48:39.4920184Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:48:39.4920476Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:48:39.4920747Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:48:39.4921042Z #define _IO_funlockfile(_fp) 2025-05-07T19:48:39.4921622Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:48:39.4922219Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:48:39.4922487Z #define _IO_off64_t __off64_t 2025-05-07T19:48:39.4922759Z #define _IO_off_t __off_t 2025-05-07T19:48:39.4923061Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:48:39.4923719Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:48:39.4924357Z #define _IO_pid_t __pid_t 2025-05-07T19:48:39.4925119Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:48:39.4925779Z #define _IO_size_t size_t 2025-05-07T19:48:39.4926016Z #define _IO_ssize_t __ssize_t 2025-05-07T19:48:39.4926306Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:48:39.4926649Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:48:39.4926974Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:48:39.4927287Z #define _IO_uid_t __uid_t 2025-05-07T19:48:39.4927524Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:48:39.4927800Z #define _IO_wint_t wint_t 2025-05-07T19:48:39.4928028Z #define _ISOC11_SOURCE 1 2025-05-07T19:48:39.4928259Z #define _ISOC95_SOURCE 1 2025-05-07T19:48:39.4928481Z #define _ISOC99_SOURCE 1 2025-05-07T19:48:39.4928716Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:48:39.4928980Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:48:39.4929215Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:48:39.4929462Z #define _LINUX_LIMITS_H 2025-05-07T19:48:39.4929680Z #define _LP64 1 2025-05-07T19:48:39.4929888Z #define _MATH_H 1 2025-05-07T19:48:39.4930092Z #define _MATH_H_MATHDEF 1 2025-05-07T19:48:39.4930315Z #define _MOVE_H 1 2025-05-07T19:48:39.4930514Z #define _Mfloat_ float 2025-05-07T19:48:39.4930754Z #define _Mlong_double_ long double 2025-05-07T19:48:39.4931007Z #define _NEW 2025-05-07T19:48:39.4931219Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:48:39.4931492Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:48:39.4931753Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:48:39.4932017Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:48:39.4932277Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:48:39.4932563Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:48:39.4932841Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:48:39.4933120Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:48:39.4933378Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:48:39.4933635Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:48:39.4933888Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:48:39.4934146Z #define _POSIX_AIO_MAX 1 2025-05-07T19:48:39.4934379Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:48:39.4934622Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:48:39.4934877Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:48:39.4935150Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:48:39.4935414Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:48:39.4935688Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:48:39.4935990Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:48:39.4936266Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:48:39.4936607Z #define _POSIX_LINK_MAX 8 2025-05-07T19:48:39.4936919Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:48:39.4937175Z #define _POSIX_MAX_CANON 255 2025-05-07T19:48:39.4937423Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:48:39.4937675Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:48:39.4937935Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:48:39.4938180Z #define _POSIX_NAME_MAX 14 2025-05-07T19:48:39.4938432Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:48:39.4938679Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:48:39.4938924Z #define _POSIX_PATH_MAX 256 2025-05-07T19:48:39.4939164Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:48:39.4939406Z #define _POSIX_QLIMIT 1 2025-05-07T19:48:39.4939634Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:48:39.4939886Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:48:39.4940127Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:48:39.4940396Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:48:39.4940672Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:48:39.4940916Z #define _POSIX_SOURCE 1 2025-05-07T19:48:39.4941162Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:48:39.4941411Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:48:39.4941665Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:48:39.4941915Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:48:39.4942197Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:48:39.4942505Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:48:39.4942787Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:48:39.4943062Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:48:39.4943311Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:48:39.4943570Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:48:39.4943979Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:48:39.4944323Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:48:39.4944813Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:48:39.4945432Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:48:39.4945916Z #define _PSTL_CONFIG_H 2025-05-07T19:48:39.4946389Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:48:39.4947265Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:48:39.4948070Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:48:39.4948873Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:48:39.4949847Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:48:39.4950605Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:48:39.4951069Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:39.4951574Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:48:39.4952044Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:48:39.4952566Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:48:39.4952950Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:48:39.4953392Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:39.4953783Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:48:39.4954075Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:48:39.4954765Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:48:39.4955548Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:48:39.4955946Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:48:39.4956313Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:48:39.4956679Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:48:39.4957337Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:48:39.4957897Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:48:39.4958264Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:48:39.4958613Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:48:39.4958937Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:48:39.4959302Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:48:39.4959664Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:48:39.4960092Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:48:39.4960611Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:48:39.4961083Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:48:39.4961387Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:48:39.4961727Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:48:39.4962057Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:48:39.4962351Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:48:39.4962665Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:48:39.4963117Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:48:39.4963626Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:48:39.4963940Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:48:39.4964292Z #define _PSTL_VERSION 12000 2025-05-07T19:48:39.4964600Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:48:39.4965019Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:48:39.4965431Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:48:39.4965764Z #define _PTRDIFF_T 2025-05-07T19:48:39.4966012Z #define _PTR_TRAITS_H 1 2025-05-07T19:48:39.4966259Z #define _SIGSET_H_types 1 2025-05-07T19:48:39.4966615Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:48:39.4966990Z #define _SIZE_T 2025-05-07T19:48:39.4967233Z #define _STDC_PREDEF_H 1 2025-05-07T19:48:39.4967482Z #define _STDIO_H 1 2025-05-07T19:48:39.4967732Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:48:39.4967996Z #define _STDLIB_H 1 2025-05-07T19:48:39.4968242Z #define _STL_ALGOBASE_H 1 2025-05-07T19:48:39.4968528Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:48:39.4968890Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:48:39.4969196Z #define _STL_ITERATOR_H 1 2025-05-07T19:48:39.4969448Z #define _STL_PAIR_H 1 2025-05-07T19:48:39.4969700Z #define _STL_RELOPS_H 1 2025-05-07T19:48:39.4969936Z #define _STRING_H 1 2025-05-07T19:48:39.4970183Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:48:39.4970432Z #define _SVID_SOURCE 1 2025-05-07T19:48:39.4970680Z #define _SYS_CDEFS_H 1 2025-05-07T19:48:39.4970918Z #define _SYS_SELECT_H 1 2025-05-07T19:48:39.4971176Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:48:39.4971444Z #define _SYS_TYPES_H 1 2025-05-07T19:48:39.4971676Z #define _TIME_H 1 2025-05-07T19:48:39.4971916Z #define _VA_LIST_DEFINED 2025-05-07T19:48:39.4972165Z #define _XLOCALE_H 1 2025-05-07T19:48:39.4972438Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:48:39.4972940Z #define _XOPEN_LIM_H 1 2025-05-07T19:48:39.4973210Z #define _XOPEN_SOURCE 700 2025-05-07T19:48:39.4973478Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:48:39.4973871Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:48:39.4974334Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:48:39.4974743Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:48:39.4975112Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:48:39.4975426Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:48:39.4975696Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:48:39.4975950Z #define __ATOMIC_CONSUME 1 2025-05-07T19:48:39.4976215Z #define __ATOMIC_RELAXED 0 2025-05-07T19:48:39.4976465Z #define __ATOMIC_RELEASE 3 2025-05-07T19:48:39.4976729Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:48:39.4976991Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:48:39.4977387Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:48:39.4977719Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:48:39.4978039Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:48:39.4978331Z #define __BIG_ENDIAN 4321 2025-05-07T19:48:39.4978594Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:48:39.4978901Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:48:39.4979186Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:39.4979528Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.4979875Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.4980211Z #define __BOOL_WIDTH__ 8 2025-05-07T19:48:39.4980477Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:48:39.4980810Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:48:39.4981142Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:48:39.4981457Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:48:39.4981776Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:48:39.4982060Z #define __CHAR_BIT__ 8 2025-05-07T19:48:39.4982332Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:39.4982662Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:39.4983008Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:39.4983340Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:39.4983669Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:39.4983982Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:39.4984323Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:39.4984666Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:39.4984995Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:39.4985334Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:39.4985646Z #define __CLANG_LIMITS_H 2025-05-07T19:48:39.4986074Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:48:39.4986381Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:48:39.4986714Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.4987034Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:48:39.4987326Z #define __COMPAR_FN_T 2025-05-07T19:48:39.4987584Z #define __CONCAT(x,y) x ## y 2025-05-07T19:48:39.4987880Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:48:39.4988182Z #define __CUDACC_VER_BUILD__ 89 2025-05-07T19:48:39.4988462Z #define __CUDACC_VER_MAJOR__ 11 2025-05-07T19:48:39.4988758Z #define __CUDACC_VER_MINOR__ 8 2025-05-07T19:48:39.4989389Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:48:39.4990055Z #define __CUDACC__ 1 2025-05-07T19:48:39.4990311Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:48:39.4990636Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:48:39.4991108Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:48:39.4991620Z #define __CUDA_API_VER_MAJOR__ 11 2025-05-07T19:48:39.4991924Z #define __CUDA_API_VER_MINOR__ 8 2025-05-07T19:48:39.4992209Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:48:39.4992554Z #define __CUDA_ARCH__ 520 2025-05-07T19:48:39.4992832Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:48:39.4993149Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:48:39.4993414Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:48:39.4993692Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:48:39.4993960Z #define __CUDA_SURFACE_TYPES_H__ 2025-05-07T19:48:39.4994263Z #define __CUDA_TEXTURE_TYPES_H__ 2025-05-07T19:48:39.4994565Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:48:39.4994847Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:48:39.4995161Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:48:39.4995486Z #define __DBL_DIG__ 15 2025-05-07T19:48:39.4995769Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:48:39.4996087Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:48:39.4996360Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:48:39.4996624Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:39.4996893Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:48:39.4997146Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:48:39.4997622Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:48:39.4997902Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:48:39.4998196Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:48:39.4998491Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:48:39.4998770Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:48:39.4999115Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:48:39.4999430Z #define __DELETE_THROW throw() 2025-05-07T19:48:39.4999713Z #define __DEPRECATED 1 2025-05-07T19:48:39.4999977Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5000311Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:39.5000627Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5000959Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:48:39.5001274Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5001562Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:48:39.5001850Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:48:39.5002144Z #define __DEVICE_TYPES_H__ 2025-05-07T19:48:39.5002425Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:39.5002707Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:48:39.5002991Z #define __DRIVER_TYPES_H__ 2025-05-07T19:48:39.5003239Z #define __ELF__ 1 2025-05-07T19:48:39.5003472Z #define __END_DECLS } 2025-05-07T19:48:39.5003710Z #define __END_NAMESPACE_C99 2025-05-07T19:48:39.5003992Z #define __END_NAMESPACE_STD 2025-05-07T19:48:39.5004261Z #define __EXCEPTIONS 1 2025-05-07T19:48:39.5004504Z #define __EXCEPTION_H 1 2025-05-07T19:48:39.5004761Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:48:39.5005182Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:48:39.5005618Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:48:39.5006014Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:48:39.5006496Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:48:39.5006950Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:48:39.5007374Z #define __FD_SETSIZE 1024 2025-05-07T19:48:39.5008089Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:48:39.5008952Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:48:39.5009226Z #define __FILE_defined 1 2025-05-07T19:48:39.5009476Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:48:39.5009745Z #define __FLOAT128__ 1 2025-05-07T19:48:39.5009987Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:48:39.5010290Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:48:39.5010595Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:48:39.5010906Z #define __FLT16_DIG__ 3 2025-05-07T19:48:39.5011167Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:48:39.5011457Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:48:39.5011733Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:48:39.5012014Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:48:39.5012295Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:48:39.5012548Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:48:39.5012807Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:48:39.5013068Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:48:39.5013349Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:48:39.5013728Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:48:39.5013976Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:48:39.5014260Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:48:39.5014515Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:48:39.5014797Z #define __FLT_DIG__ 6 2025-05-07T19:48:39.5015026Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:48:39.5015311Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:48:39.5015557Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:48:39.5015822Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:48:39.5016070Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:48:39.5016387Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:48:39.5016719Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:48:39.5016957Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:48:39.5017226Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:48:39.5017477Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:48:39.5017737Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:48:39.5017987Z #define __FLT_RADIX__ 2 2025-05-07T19:48:39.5018242Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:39.5018557Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:39.5018916Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:39.5019258Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:39.5019627Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:48:39.5019996Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.5020305Z #define __FXSR__ 1 2025-05-07T19:48:39.5020579Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:48:39.5020871Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:39.5021213Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:39.5021527Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:39.5021855Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:39.5022155Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:39.5022471Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:39.5022788Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:39.5023092Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:39.5023420Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:39.5023733Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:48:39.5024073Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:39.5024382Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:48:39.5024707Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:48:39.5025040Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:48:39.5025386Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:48:39.5025744Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:48:39.5026059Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:48:39.5026371Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:48:39.5026681Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:48:39.5027002Z #define __GLIBCXX__ 20230528 2025-05-07T19:48:39.5027277Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:48:39.5027579Z #define __GLIBC_MINOR__ 17 2025-05-07T19:48:39.5027989Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:39.5028457Z #define __GLIBC__ 2 2025-05-07T19:48:39.5028704Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:48:39.5028995Z #define __GNUC_MINOR__ 2 2025-05-07T19:48:39.5029284Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:48:39.5029687Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:39.5030143Z #define __GNUC_VA_LIST 2025-05-07T19:48:39.5030381Z #define __GNUC__ 4 2025-05-07T19:48:39.5030634Z #define __GNUG__ 4 2025-05-07T19:48:39.5030865Z #define __GNU_LIBRARY__ 6 2025-05-07T19:48:39.5031148Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:48:39.5031436Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:48:39.5031746Z #define __GXX_RTTI 1 2025-05-07T19:48:39.5031988Z #define __GXX_WEAK__ 1 2025-05-07T19:48:39.5032234Z #define __HAVE_COLUMN 2025-05-07T19:48:39.5032553Z #define __HOST_CONFIG_H__ 2025-05-07T19:48:39.5032965Z #define __HOST_DEFINES_H__ 2025-05-07T19:48:39.5033246Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:48:39.5033519Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:39.5033835Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:48:39.5034133Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:39.5034451Z #define __INT16_C_SUFFIX__ 2025-05-07T19:48:39.5034704Z #define __INT16_FMTd__ "hd" 2025-05-07T19:48:39.5034969Z #define __INT16_FMTi__ "hi" 2025-05-07T19:48:39.5035220Z #define __INT16_MAX__ 32767 2025-05-07T19:48:39.5035483Z #define __INT16_TYPE__ short 2025-05-07T19:48:39.5035755Z #define __INT32_C_SUFFIX__ 2025-05-07T19:48:39.5036164Z #define __INT32_FMTd__ "d" 2025-05-07T19:48:39.5036431Z #define __INT32_FMTi__ "i" 2025-05-07T19:48:39.5036683Z #define __INT32_MAX__ 2147483647 2025-05-07T19:48:39.5036961Z #define __INT32_TYPE__ int 2025-05-07T19:48:39.5037211Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:48:39.5037481Z #define __INT64_FMTd__ "ld" 2025-05-07T19:48:39.5037736Z #define __INT64_FMTi__ "li" 2025-05-07T19:48:39.5038012Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5038312Z #define __INT64_TYPE__ long int 2025-05-07T19:48:39.5038587Z #define __INT8_C_SUFFIX__ 2025-05-07T19:48:39.5038845Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:48:39.5039098Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:48:39.5039359Z #define __INT8_MAX__ 127 2025-05-07T19:48:39.5039610Z #define __INT8_TYPE__ signed char 2025-05-07T19:48:39.5039904Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:48:39.5040173Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:48:39.5040446Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:48:39.5040731Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5041048Z #define __INTMAX_TYPE__ long int 2025-05-07T19:48:39.5041325Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:48:39.5041595Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:48:39.5041877Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:48:39.5042156Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5042483Z #define __INTPTR_TYPE__ long int 2025-05-07T19:48:39.5042765Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:48:39.5043039Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:48:39.5043316Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:48:39.5043600Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:48:39.5043878Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:48:39.5043984Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:48:39.5044078Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:48:39.5044173Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:48:39.5044277Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:48:39.5044384Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:48:39.5044486Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:48:39.5044585Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:48:39.5044691Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:48:39.5044820Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5045027Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:48:39.5045133Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:48:39.5045224Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:48:39.5045315Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:48:39.5045408Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:48:39.5045525Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:48:39.5045617Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:48:39.5045714Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:48:39.5045826Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:48:39.5045919Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:48:39.5046018Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:48:39.5046110Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:48:39.5046226Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:48:39.5046320Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:48:39.5046417Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:48:39.5046525Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:48:39.5046619Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:48:39.5046714Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:48:39.5046808Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:48:39.5046945Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5047045Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:48:39.5047139Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:48:39.5047247Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:48:39.5047341Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:48:39.5047435Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:48:39.5047538Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:48:39.5047648Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:48:39.5047739Z #define __INT_MAX__ 2147483647 2025-05-07T19:48:39.5048107Z #define __INT_WIDTH__ 32 2025-05-07T19:48:39.5048222Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:48:39.5048321Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:48:39.5048452Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:48:39.5048602Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:48:39.5048712Z #define __LDBL_DIG__ 18 2025-05-07T19:48:39.5048846Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:48:39.5048945Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:48:39.5049060Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:48:39.5049160Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:39.5049256Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:48:39.5049355Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:48:39.5049470Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:48:39.5049594Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:48:39.5049694Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:48:39.5049809Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:48:39.5049939Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:48:39.5050059Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:48:39.5050216Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:48:39.5050395Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:48:39.5050509Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:48:39.5050658Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:48:39.5050755Z #define __LEAF 2025-05-07T19:48:39.5050840Z #define __LEAF_ATTR 2025-05-07T19:48:39.5050937Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:48:39.5051048Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:48:39.5051140Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:48:39.5051232Z #define __LLONG_WIDTH__ 64 2025-05-07T19:48:39.5051350Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:48:39.5051468Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:48:39.5051570Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5051667Z #define __LONG_WIDTH__ 64 2025-05-07T19:48:39.5051765Z #define __LP64__ 1 2025-05-07T19:48:39.5052268Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:48:39.5052954Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:48:39.5053071Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:48:39.5053173Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5053272Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:48:39.5053357Z #define __MMX__ 1 2025-05-07T19:48:39.5053473Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:48:39.5053567Z #define __N(msgid) (msgid) 2025-05-07T19:48:39.5053695Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:48:39.5053832Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:39.5053919Z #define __NO_CTYPE 1 2025-05-07T19:48:39.5054014Z #define __NO_INLINE__ 1 2025-05-07T19:48:39.5054114Z #define __NO_MATH_INLINES 1 2025-05-07T19:48:39.5054243Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:48:39.5054357Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:48:39.5054445Z #define __NVCC__ 1 2025-05-07T19:48:39.5054566Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:48:39.5054673Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:48:39.5054775Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:48:39.5054879Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:39.5054997Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:48:39.5055113Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.5055246Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:48:39.5055370Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:48:39.5055482Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:48:39.5055598Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:48:39.5055711Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:48:39.5055934Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:48:39.5056040Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:48:39.5056141Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:48:39.5056246Z #define __P(args) args 2025-05-07T19:48:39.5056370Z #define __PDP_ENDIAN 3412 2025-05-07T19:48:39.5056457Z #define __PIC__ 2 2025-05-07T19:48:39.5056557Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:48:39.5056661Z #define __PIE__ 2 2025-05-07T19:48:39.5056752Z #define __PMT(args) args 2025-05-07T19:48:39.5056850Z #define __POINTER_WIDTH__ 64 2025-05-07T19:48:39.5056970Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:48:39.5057075Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:48:39.5057194Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:48:39.5057296Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:48:39.5057414Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:48:39.5057514Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:48:39.5057629Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:48:39.5057752Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:48:39.5057851Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:48:39.5058087Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:39.5058325Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:48:39.5058593Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:39.5058877Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:39.5059126Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:48:39.5059239Z #define __REGISTER_PREFIX__ 2025-05-07T19:48:39.5059343Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:39.5059458Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:39.5059568Z #define __S16_TYPE short int 2025-05-07T19:48:39.5059657Z #define __S32_TYPE int 2025-05-07T19:48:39.5059755Z #define __S64_TYPE long int 2025-05-07T19:48:39.5059852Z #define __SCHAR_MAX__ 127 2025-05-07T19:48:39.5059952Z #define __SEG_FS 1 2025-05-07T19:48:39.5060038Z #define __SEG_GS 1 2025-05-07T19:48:39.5060131Z #define __SHRT_MAX__ 32767 2025-05-07T19:48:39.5060239Z #define __SHRT_WIDTH__ 16 2025-05-07T19:48:39.5060342Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:48:39.5060442Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:48:39.5060590Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:48:39.5060705Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:48:39.5060800Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:48:39.5060895Z #define __SIZEOF_INT128__ 16 2025-05-07T19:48:39.5061003Z #define __SIZEOF_INT__ 4 2025-05-07T19:48:39.5061104Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:48:39.5061204Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:48:39.5061297Z #define __SIZEOF_LONG__ 8 2025-05-07T19:48:39.5061410Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:48:39.5061513Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:48:39.5061635Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:48:39.5061756Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:48:39.5061864Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:48:39.5061967Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:48:39.5062080Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:48:39.5062199Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:48:39.5062309Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:48:39.5062415Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:48:39.5062530Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:48:39.5062624Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:48:39.5062718Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:48:39.5062815Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:48:39.5062924Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:48:39.5063017Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:48:39.5063111Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:48:39.5063219Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:48:39.5063312Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:48:39.5064190Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:48:39.5064298Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:48:39.5064407Z #define __SIZE_WIDTH__ 64 2025-05-07T19:48:39.5064501Z #define __SLONG32_TYPE int 2025-05-07T19:48:39.5064605Z #define __SLONGWORD_TYPE long int 2025-05-07T19:48:39.5064729Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5064834Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:39.5064934Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:48:39.5065048Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:48:39.5065147Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:48:39.5065244Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:48:39.5065350Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5065470Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:39.5065572Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:48:39.5065670Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:48:39.5065787Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:39.5065893Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:48:39.5065999Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5066126Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:39.5066244Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:48:39.5066342Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:48:39.5066435Z #define __SM_70_RT_HPP__ 2025-05-07T19:48:39.5066539Z #define __SM_70_RT_H__ 2025-05-07T19:48:39.5066634Z #define __SM_80_RT_HPP__ 2025-05-07T19:48:39.5066722Z #define __SM_80_RT_H__ 2025-05-07T19:48:39.5066816Z #define __SM_90_RT_HPP__ 2025-05-07T19:48:39.5066923Z #define __SM_90_RT_H__ 2025-05-07T19:48:39.5067023Z #define __SQUAD_TYPE long int 2025-05-07T19:48:39.5067115Z #define __SSE2_MATH__ 1 2025-05-07T19:48:39.5067215Z #define __SSE2__ 1 2025-05-07T19:48:39.5067493Z #define __SSE_MATH__ 1 2025-05-07T19:48:39.5067594Z #define __SSE__ 1 2025-05-07T19:48:39.5067698Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:48:39.5067844Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:48:39.5067971Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:48:39.5068072Z #define __STDCPP_THREADS__ 1 2025-05-07T19:48:39.5068182Z #define __STDC_HOSTED__ 1 2025-05-07T19:48:39.5068287Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:48:39.5068381Z #define __STDC_IEC_559__ 1 2025-05-07T19:48:39.5068479Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:48:39.5068594Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:48:39.5068687Z #define __STDC_UTF_16__ 1 2025-05-07T19:48:39.5068779Z #define __STDC_UTF_32__ 1 2025-05-07T19:48:39.5068880Z #define __STDC__ 1 2025-05-07T19:48:39.5068967Z #define __STDDEF_H 2025-05-07T19:48:39.5069058Z #define __STRING(x) #x 2025-05-07T19:48:39.5069162Z #define __SURFACE_FUNCTIONS_H__ 2025-05-07T19:48:39.5069293Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:39.5069393Z #define __SURFACE_TYPES_H__ 2025-05-07T19:48:39.5069526Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.5069641Z #define __SWORD_TYPE long int 2025-05-07T19:48:39.5069773Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:48:39.5069896Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:48:39.5069997Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:48:39.5070125Z #define __TEXTURE_FETCH_FUNCTIONS_H__ 2025-05-07T19:48:39.5070240Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:39.5070340Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:48:39.5070450Z #define __THROW throw () 2025-05-07T19:48:39.5070546Z #define __THROWNL throw () 2025-05-07T19:48:39.5070649Z #define __TIMER_T_TYPE void * 2025-05-07T19:48:39.5070764Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:39.5070891Z #define __U16_TYPE unsigned short int 2025-05-07T19:48:39.5070992Z #define __U32_TYPE unsigned int 2025-05-07T19:48:39.5071098Z #define __U64_TYPE unsigned long int 2025-05-07T19:48:39.5071214Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:48:39.5071315Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:48:39.5071413Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:48:39.5071628Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:48:39.5071737Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:48:39.5071834Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:48:39.5071974Z #define __UINT16_MAX__ 65535 2025-05-07T19:48:39.5072100Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:48:39.5072200Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:48:39.5072342Z #define __UINT32_FMTX__ "X" 2025-05-07T19:48:39.5072444Z #define __UINT32_FMTo__ "o" 2025-05-07T19:48:39.5072558Z #define __UINT32_FMTu__ "u" 2025-05-07T19:48:39.5072653Z #define __UINT32_FMTx__ "x" 2025-05-07T19:48:39.5072755Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:48:39.5072878Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:48:39.5072978Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:48:39.5073076Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:48:39.5073173Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:48:39.5073287Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:48:39.5073382Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:48:39.5073541Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:48:39.5073672Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:48:39.5073767Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:48:39.5073863Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:48:39.5073961Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:48:39.5074073Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:48:39.5074171Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:48:39.5074265Z #define __UINT8_MAX__ 255 2025-05-07T19:48:39.5074384Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:48:39.5074485Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:48:39.5074586Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:48:39.5074687Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:48:39.5074803Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:48:39.5074901Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:48:39.5075019Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:48:39.5075151Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:48:39.5075252Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:48:39.5075352Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:48:39.5075449Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:48:39.5075558Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:48:39.5075655Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:48:39.5075771Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:48:39.5075899Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:48:39.5075997Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:48:39.5076096Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:48:39.5076207Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:48:39.5076305Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:48:39.5076402Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:48:39.5092374Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:48:39.5092573Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:48:39.5092687Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:48:39.5092786Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:48:39.5092902Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:48:39.5093021Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:48:39.5093129Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:48:39.5093255Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:48:39.5093361Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:48:39.5093459Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:48:39.5093569Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:48:39.5093668Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:48:39.5093799Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:48:39.5093925Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:48:39.5094040Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:48:39.5094147Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:48:39.5094241Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:48:39.5094352Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:48:39.5094454Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:48:39.5094566Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:48:39.5094937Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:48:39.5095057Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:48:39.5095157Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:48:39.5095256Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:48:39.5095370Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:48:39.5095490Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:48:39.5095590Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:48:39.5095690Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:48:39.5095799Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:48:39.5095899Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:48:39.5096003Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:48:39.5096132Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:48:39.5096233Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:48:39.5096336Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:48:39.5096436Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:48:39.5096545Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:48:39.5096683Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:48:39.5096810Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:48:39.5096921Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:48:39.5097021Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:48:39.5097121Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:48:39.5097231Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:48:39.5097327Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:48:39.5097442Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:48:39.5097542Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:48:39.5097670Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:48:39.5097779Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:48:39.5097882Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:48:39.5097996Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:48:39.5098083Z #define __USE_ANSI 1 2025-05-07T19:48:39.5098171Z #define __USE_ATFILE 1 2025-05-07T19:48:39.5098260Z #define __USE_BSD 1 2025-05-07T19:48:39.5098370Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:48:39.5098454Z #define __USE_GNU 1 2025-05-07T19:48:39.5098544Z #define __USE_ISOC11 1 2025-05-07T19:48:39.5098643Z #define __USE_ISOC95 1 2025-05-07T19:48:39.5098730Z #define __USE_ISOC99 1 2025-05-07T19:48:39.5098822Z #define __USE_ISOCXX11 1 2025-05-07T19:48:39.5098912Z #define __USE_LARGEFILE 1 2025-05-07T19:48:39.5099017Z #define __USE_LARGEFILE64 1 2025-05-07T19:48:39.5099106Z #define __USE_MISC 1 2025-05-07T19:48:39.5099193Z #define __USE_POSIX 1 2025-05-07T19:48:39.5099301Z #define __USE_POSIX199309 1 2025-05-07T19:48:39.5099393Z #define __USE_POSIX199506 1 2025-05-07T19:48:39.5099481Z #define __USE_POSIX2 1 2025-05-07T19:48:39.5099568Z #define __USE_SVID 1 2025-05-07T19:48:39.5099664Z #define __USE_UNIX98 1 2025-05-07T19:48:39.5099753Z #define __USE_XOPEN 1 2025-05-07T19:48:39.5099843Z #define __USE_XOPEN2K 1 2025-05-07T19:48:39.5099931Z #define __USE_XOPEN2K8 1 2025-05-07T19:48:39.5100147Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:48:39.5100247Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:48:39.5100345Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:48:39.5100459Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:48:39.5100561Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:48:39.5100668Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:48:39.5100767Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:48:39.5100876Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:48:39.5100971Z #define __VECTOR_TYPES_H__ 2025-05-07T19:48:39.5101420Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:48:39.5101552Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:48:39.5101654Z #define __WAIT_STATUS void * 2025-05-07T19:48:39.5101752Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:48:39.5101855Z #define __WALL 0x40000000 2025-05-07T19:48:39.5101948Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:48:39.5102041Z #define __WCHAR_TYPE__ int 2025-05-07T19:48:39.5102240Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:48:39.5102342Z #define __WCLONE 0x80000000 2025-05-07T19:48:39.5102481Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:48:39.5102573Z #define __WCOREFLAG 0x80 2025-05-07T19:48:39.5102738Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:48:39.5102898Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:48:39.5103040Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:48:39.5103271Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:48:39.5103435Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:48:39.5103537Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:48:39.5103639Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:48:39.5103741Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:48:39.5103935Z #define __WINT_WIDTH__ 32 2025-05-07T19:48:39.5104023Z #define __WNOTHREAD 0x20000000 2025-05-07T19:48:39.5104108Z #define __WORDSIZE 64 2025-05-07T19:48:39.5104219Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:48:39.5104341Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:48:39.5104451Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:48:39.5104559Z #define __W_CONTINUED 0xffff 2025-05-07T19:48:39.5104680Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:48:39.5104790Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:48:39.5104892Z #define ____FILE_defined 1 2025-05-07T19:48:39.5104984Z #define ____mbstate_t_defined 1 2025-05-07T19:48:39.5105099Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:48:39.5105282Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:48:39.5105376Z #define __amd64 1 2025-05-07T19:48:39.5105455Z #define __amd64__ 1 2025-05-07T19:48:39.5105559Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:48:39.5105671Z #define __attribute_artificial__ 2025-05-07T19:48:39.5105812Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:48:39.5105992Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:48:39.5106187Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:48:39.5106451Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:48:39.5106595Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:48:39.5106752Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:48:39.5106893Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:48:39.5107023Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:48:39.5107249Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:48:39.5107355Z #define __blkcnt_t_defined 2025-05-07T19:48:39.5107447Z #define __blksize_t_defined 2025-05-07T19:48:39.5107633Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:48:39.5107766Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:48:39.5107859Z #define __bounded 2025-05-07T19:48:39.5108459Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:48:39.5108935Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:48:39.5109409Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:48:39.5109663Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:48:39.5110070Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:48:39.5111054Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:48:39.5111159Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:48:39.5111262Z #define __catch(X) catch(X) 2025-05-07T19:48:39.5111343Z #define __cdecl 2025-05-07T19:48:39.5111423Z #define __clang__ 1 2025-05-07T19:48:39.5111533Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:48:39.5111636Z #define __clang_major__ 16 2025-05-07T19:48:39.5111723Z #define __clang_minor__ 0 2025-05-07T19:48:39.5111815Z #define __clang_patchlevel__ 6 2025-05-07T19:48:39.5112239Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:48:39.5112454Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:48:39.5112548Z #define __clock_t_defined 1 2025-05-07T19:48:39.5112640Z #define __clockid_t_defined 1 2025-05-07T19:48:39.5113021Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:48:39.5113119Z #define __code_model_small__ 1 2025-05-07T19:48:39.5113234Z #define __constant__ __location__(constant) 2025-05-07T19:48:39.5113348Z #define __cplusplus 201703L 2025-05-07T19:48:39.5113454Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:48:39.5113560Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:48:39.5113683Z #define __cpp_alias_templates 200704L 2025-05-07T19:48:39.5113786Z #define __cpp_aligned_new 201606L 2025-05-07T19:48:39.5113884Z #define __cpp_attributes 200809L 2025-05-07T19:48:39.5113990Z #define __cpp_binary_literals 201304L 2025-05-07T19:48:39.5114120Z #define __cpp_capture_star_this 201603L 2025-05-07T19:48:39.5114223Z #define __cpp_constexpr 201603L 2025-05-07T19:48:39.5114342Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:48:39.5114455Z #define __cpp_decltype 200707L 2025-05-07T19:48:39.5114559Z #define __cpp_decltype_auto 201304L 2025-05-07T19:48:39.5114663Z #define __cpp_deduction_guides 201703L 2025-05-07T19:48:39.5114790Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:48:39.5114912Z #define __cpp_digit_separators 201309L 2025-05-07T19:48:39.5115030Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:48:39.5115131Z #define __cpp_exceptions 199711L 2025-05-07T19:48:39.5115246Z #define __cpp_fold_expressions 201603L 2025-05-07T19:48:39.5115351Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:48:39.5115476Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:48:39.5115577Z #define __cpp_hex_float 201603L 2025-05-07T19:48:39.5115691Z #define __cpp_if_constexpr 201606L 2025-05-07T19:48:39.5115813Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:48:39.5115943Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:48:39.5116058Z #define __cpp_init_captures 201304L 2025-05-07T19:48:39.5116167Z #define __cpp_initializer_lists 200806L 2025-05-07T19:48:39.5116275Z #define __cpp_inline_variables 201606L 2025-05-07T19:48:39.5116373Z #define __cpp_lambdas 200907L 2025-05-07T19:48:39.5116507Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:48:39.5116621Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:48:39.5116719Z #define __cpp_lib_as_const 201510 2025-05-07T19:48:39.5116838Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:48:39.5116953Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:48:39.5117122Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:48:39.5117232Z #define __cpp_lib_hypot 201603 2025-05-07T19:48:39.5117342Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:48:39.5117479Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:48:39.5117707Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:48:39.5117822Z #define __cpp_lib_is_final 201402L 2025-05-07T19:48:39.5117925Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:48:39.5118036Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:48:39.5118158Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:48:39.5118260Z #define __cpp_lib_launder 201606 2025-05-07T19:48:39.5118367Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:48:39.5118494Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:48:39.5118636Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:48:39.5118747Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:48:39.5118892Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:48:39.5119051Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:48:39.5119163Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:48:39.5119273Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:48:39.5119441Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:48:39.5119554Z #define __cpp_lib_void_t 201411 2025-05-07T19:48:39.5119679Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:48:39.5119798Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:48:39.5119948Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:48:39.5120069Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:48:39.5120190Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:48:39.5120349Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:48:39.5120443Z #define __cpp_nsdmi 200809L 2025-05-07T19:48:39.5120547Z #define __cpp_range_based_for 201603L 2025-05-07T19:48:39.5120651Z #define __cpp_raw_strings 200710L 2025-05-07T19:48:39.5120769Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:48:39.5120887Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:48:39.5120980Z #define __cpp_rtti 199711L 2025-05-07T19:48:39.5121103Z #define __cpp_rvalue_references 200610L 2025-05-07T19:48:39.5121212Z #define __cpp_static_assert 201411L 2025-05-07T19:48:39.5121325Z #define __cpp_static_call_operator 202207L 2025-05-07T19:48:39.5121438Z #define __cpp_structured_bindings 201606L 2025-05-07T19:48:39.5121557Z #define __cpp_template_auto 201606L 2025-05-07T19:48:39.5121675Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:48:39.5121786Z #define __cpp_unicode_characters 200704L 2025-05-07T19:48:39.5121906Z #define __cpp_unicode_literals 200710L 2025-05-07T19:48:39.5122020Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:48:39.5122133Z #define __cpp_variable_templates 201304L 2025-05-07T19:48:39.5122239Z #define __cpp_variadic_templates 200704L 2025-05-07T19:48:39.5122359Z #define __cpp_variadic_using 201611L 2025-05-07T19:48:39.5122463Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:48:39.5122565Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:48:39.5122677Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:48:39.5122784Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:48:39.5122894Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:48:39.5123058Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:48:39.5123153Z #define __daddr_t_defined 2025-05-07T19:48:39.5123243Z #define __dev_t_defined 2025-05-07T19:48:39.5123347Z #define __device__ __location__(device) 2025-05-07T19:48:39.5123506Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:48:39.5123754Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:48:39.5123994Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:48:39.5124149Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:48:39.5124239Z #define __export__ 2025-05-07T19:48:39.5124501Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:39.5124714Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:39.5124979Z #define __flexarr [] 2025-05-07T19:48:39.5125202Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:48:39.5125403Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:48:39.5125510Z #define __fsblkcnt_t_defined 2025-05-07T19:48:39.5125601Z #define __fsfilcnt_t_defined 2025-05-07T19:48:39.5125685Z #define __gid_t_defined 2025-05-07T19:48:39.5125844Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:48:39.5125991Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:48:39.5126218Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:48:39.5126333Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:48:39.5126444Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:48:39.5126565Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:48:39.5126689Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:48:39.5127052Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:48:39.5127249Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:48:39.5127411Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:48:39.5127528Z #define __glibcxx_function_requires(...) 2025-05-07T19:48:39.5127633Z #define __glibcxx_integral_traps true 2025-05-07T19:48:39.5127930Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:48:39.5128182Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:48:39.5128376Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:48:39.5128516Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:48:39.5128722Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:48:39.5128845Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:48:39.5128960Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:48:39.5129111Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:48:39.5129259Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:48:39.5129394Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:48:39.5129564Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:48:39.5129756Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:48:39.5129903Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:48:39.5130007Z #define __glibcxx_requires_nonempty() 2025-05-07T19:48:39.5130200Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:48:39.5130420Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:39.5130600Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:48:39.5130825Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:39.5130957Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:48:39.5131113Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:48:39.5131273Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:48:39.5131487Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:48:39.5131598Z #define __glibcxx_requires_string(_String) 2025-05-07T19:48:39.5131729Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:48:39.5131844Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:48:39.5131976Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:48:39.5132086Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:48:39.5132183Z #define __global__ __location__(global) 2025-05-07T19:48:39.5132276Z #define __gnu_linux__ 1 2025-05-07T19:48:39.5132407Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:48:39.5132615Z #define __have_pthread_attr_t 1 2025-05-07T19:48:39.5132721Z #define __host__ __location__(host) 2025-05-07T19:48:39.5132803Z #define __id_t_defined 2025-05-07T19:48:39.5132883Z #define __import__ 2025-05-07T19:48:39.5132970Z #define __ino64_t_defined 2025-05-07T19:48:39.5133064Z #define __ino_t_defined 2025-05-07T19:48:39.5133149Z #define __int8_t_defined 2025-05-07T19:48:39.5133368Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:39.5133564Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:48:39.5133644Z #define __k8 1 2025-05-07T19:48:39.5133721Z #define __k8__ 1 2025-05-07T19:48:39.5133807Z #define __key_t_defined 2025-05-07T19:48:39.5134002Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:48:39.5134090Z #define __ldiv_t_defined 1 2025-05-07T19:48:39.5134168Z #define __linux 1 2025-05-07T19:48:39.5134263Z #define __linux__ 1 2025-05-07T19:48:39.5134352Z #define __lldiv_t_defined 1 2025-05-07T19:48:39.5134431Z #define __llvm__ 1 2025-05-07T19:48:39.5134541Z #define __location__(a) __annotate__(a) 2025-05-07T19:48:39.5134637Z #define __long_double_t long double 2025-05-07T19:48:39.5134732Z #define __malloc_and_calloc_defined 2025-05-07T19:48:39.5134835Z #define __managed__ __location__(managed) 2025-05-07T19:48:39.5134932Z #define __mode_t_defined 2025-05-07T19:48:39.5135012Z #define __need_IOV_MAX 2025-05-07T19:48:39.5135094Z #define __need_clock_t 2025-05-07T19:48:39.5135188Z #define __need_clockid_t 2025-05-07T19:48:39.5135270Z #define __need_time_t 2025-05-07T19:48:39.5135350Z #define __need_timer_t 2025-05-07T19:48:39.5135437Z #define __need_timespec 2025-05-07T19:48:39.5135535Z #define __nlink_t_defined 2025-05-07T19:48:39.5135655Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:48:39.5135768Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:48:39.5135948Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:48:39.5136035Z #define __off64_t_defined 2025-05-07T19:48:39.5136119Z #define __off_t_defined 2025-05-07T19:48:39.5136196Z #define __pic__ 2 2025-05-07T19:48:39.5136297Z #define __pid_t_defined 2025-05-07T19:48:39.5136373Z #define __pie__ 2 2025-05-07T19:48:39.5136469Z #define __private_extern__ extern 2025-05-07T19:48:39.5136567Z #define __ptr_t void * 2025-05-07T19:48:39.5136646Z #define __ptrvalue 2025-05-07T19:48:39.5136729Z #define __restrict_arr 2025-05-07T19:48:39.5136859Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:48:39.5136999Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:48:39.5137094Z #define __shared__ __location__(shared) 2025-05-07T19:48:39.5137181Z #define __sigset_t_defined 2025-05-07T19:48:39.5137290Z #define __specialization_static 2025-05-07T19:48:39.5137376Z #define __ssize_t_defined 2025-05-07T19:48:39.5137457Z #define __stub_bdflush 2025-05-07T19:48:39.5137537Z #define __stub_chflags 2025-05-07T19:48:39.5137640Z #define __stub_fattach 2025-05-07T19:48:39.5137722Z #define __stub_fchflags 2025-05-07T19:48:39.5137800Z #define __stub_fdetach 2025-05-07T19:48:39.5137889Z #define __stub_getmsg 2025-05-07T19:48:39.5137968Z #define __stub_gtty 2025-05-07T19:48:39.5138046Z #define __stub_lchmod 2025-05-07T19:48:39.5138128Z #define __stub_putmsg 2025-05-07T19:48:39.5138220Z #define __stub_revoke 2025-05-07T19:48:39.5138304Z #define __stub_setlogin 2025-05-07T19:48:39.5138387Z #define __stub_sigreturn 2025-05-07T19:48:39.5138480Z #define __stub_sstk 2025-05-07T19:48:39.5138559Z #define __stub_stty 2025-05-07T19:48:39.5138652Z #define __suseconds_t_defined 2025-05-07T19:48:39.5138733Z #define __thread__ __thread 2025-05-07T19:48:39.5138845Z #define __throw_exception_again throw 2025-05-07T19:48:39.5138929Z #define __time_t_defined 1 2025-05-07T19:48:39.5139015Z #define __timer_t_defined 1 2025-05-07T19:48:39.5139103Z #define __timespec_defined 1 2025-05-07T19:48:39.5139243Z #define __try try 2025-05-07T19:48:39.5139373Z #define __tune_k8__ 1 2025-05-07T19:48:39.5139464Z #define __u_char_defined 2025-05-07T19:48:39.5139736Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:39.5139822Z #define __uid_t_defined 2025-05-07T19:48:39.5139902Z #define __unbounded 2025-05-07T19:48:39.5139995Z #define __unix 1 2025-05-07T19:48:39.5140073Z #define __unix__ 1 2025-05-07T19:48:39.5140165Z #define __useconds_t_defined 2025-05-07T19:48:39.5140245Z #define __warnattr(msg) 2025-05-07T19:48:39.5140390Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:48:39.5140469Z #define __wur 2025-05-07T19:48:39.5140544Z #define __x86_64 1 2025-05-07T19:48:39.5140622Z #define __x86_64__ 1 2025-05-07T19:48:39.5140745Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:48:39.5141084Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:39.5141480Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:39.5141587Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:48:39.5141679Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:48:39.5141768Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:48:39.5141880Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:48:39.5141975Z #define cudaArrayCubemap 0x04 2025-05-07T19:48:39.5142069Z #define cudaArrayDefault 0x00 2025-05-07T19:48:39.5142177Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:48:39.5142284Z #define cudaArrayLayered 0x01 2025-05-07T19:48:39.5142378Z #define cudaArraySparse 0x40 2025-05-07T19:48:39.5142525Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:48:39.5142641Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:48:39.5142743Z #define cudaArrayTextureGather 0x08 2025-05-07T19:48:39.5142915Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:48:39.5143097Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:48:39.5143194Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:48:39.5143297Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:48:39.5143405Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:48:39.5143509Z #define cudaDeviceMapHost 0x08 2025-05-07T19:48:39.5143601Z #define cudaDeviceMask 0x1f 2025-05-07T19:48:39.5144090Z #define cudaDevicePropDontCare { {'\0'}, {{0}}, {'\0'}, 0, 0, 0, 0, 0, 0, 0, {0, 0, 0}, {0, 0, 0}, 0, 0, -1, -1, 0, 0, -1, 0, 0, 0, 0, 0, 0, 0, 0, {0, 0}, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, {0, 0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, 0, {0, 0}, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, } 2025-05-07T19:48:39.5144200Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:48:39.5144322Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:48:39.5144424Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:48:39.5144541Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:48:39.5144640Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:48:39.5144742Z #define cudaEventBlockingSync 0x01 2025-05-07T19:48:39.5144837Z #define cudaEventDefault 0x00 2025-05-07T19:48:39.5144949Z #define cudaEventDisableTiming 0x02 2025-05-07T19:48:39.5145046Z #define cudaEventInterprocess 0x04 2025-05-07T19:48:39.5145145Z #define cudaEventRecordDefault 0x00 2025-05-07T19:48:39.5145256Z #define cudaEventRecordExternal 0x01 2025-05-07T19:48:39.5145354Z #define cudaEventWaitDefault 0x00 2025-05-07T19:48:39.5145451Z #define cudaEventWaitExternal 0x01 2025-05-07T19:48:39.5145560Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:48:39.5145758Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:48:39.5145935Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:48:39.5146036Z #define cudaHostAllocDefault 0x00 2025-05-07T19:48:39.5146149Z #define cudaHostAllocMapped 0x02 2025-05-07T19:48:39.5146299Z #define cudaHostAllocPortable 0x01 2025-05-07T19:48:39.5146455Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:48:39.5146575Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:48:39.5146678Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:48:39.5146781Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:48:39.5146887Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:48:39.5147004Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:48:39.5147107Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:48:39.5147226Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:48:39.5147376Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:48:39.5147537Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:39.5147845Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:39.5148138Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:48:39.5148634Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:48:39.5148880Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:48:39.5149093Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:48:39.5149207Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:48:39.5149301Z #define cudaMemAttachHost 0x02 2025-05-07T19:48:39.5149399Z #define cudaMemAttachSingle 0x04 2025-05-07T19:48:39.5149515Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:48:39.5149617Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:48:39.5149715Z #define cudaOccupancyDefault 0x00 2025-05-07T19:48:39.5149851Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:48:39.5149969Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:48:39.5150307Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:48:39.5150431Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:48:39.5150598Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:39.5150890Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:39.5151212Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:48:39.5151316Z #define cudaStreamDefault 0x00 2025-05-07T19:48:39.5151434Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:48:39.5151537Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:48:39.5151663Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:48:39.5151768Z #define cudaSurfaceType1D 0x01 2025-05-07T19:48:39.5151873Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:48:39.5151968Z #define cudaSurfaceType2D 0x02 2025-05-07T19:48:39.5152082Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:48:39.5152181Z #define cudaSurfaceType3D 0x03 2025-05-07T19:48:39.5152282Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:48:39.5152473Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:48:39.5152590Z #define cudaTextureType1D 0x01 2025-05-07T19:48:39.5152694Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:48:39.5152961Z #define cudaTextureType2D 0x02 2025-05-07T19:48:39.5153083Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:48:39.5153184Z #define cudaTextureType3D 0x03 2025-05-07T19:48:39.5153294Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:48:39.5153451Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:48:39.5153809Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:48:39.5153910Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:48:39.5154011Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:48:39.5154119Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:48:39.5154216Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:48:39.5154305Z #define htole16(x) (x) 2025-05-07T19:48:39.5154403Z #define htole32(x) (x) 2025-05-07T19:48:39.5154489Z #define htole64(x) (x) 2025-05-07T19:48:39.5154638Z #define le16toh(x) (x) 2025-05-07T19:48:39.5154774Z #define le32toh(x) (x) 2025-05-07T19:48:39.5154871Z #define le64toh(x) (x) 2025-05-07T19:48:39.5154954Z #define linux 1 2025-05-07T19:48:39.5155059Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:48:39.5155207Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:48:39.5155358Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:48:39.5155461Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:48:39.5155588Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:48:39.5155712Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:48:39.5155802Z #define stderr stderr 2025-05-07T19:48:39.5155888Z #define stdin stdin 2025-05-07T19:48:39.5155994Z #define stdout stdout 2025-05-07T19:48:39.5156516Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:39.5157105Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:39.5157207Z #define unix 1 2025-05-07T19:48:39.5157340Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:48:39.5157466Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:48:39.5157585Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:48:39.5157718Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:48:39.5157841Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:48:39.5157848Z 2025-05-07T19:48:39.5495501Z 2025-05-07T19:48:39.5495918Z + conda run -n build_binary nvcc --version 2025-05-07T19:48:39.5495941Z 2025-05-07T19:48:41.3571263Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:48:41.3571663Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:48:41.3572042Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:48:41.3572414Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:48:41.3572773Z Build cuda_11.8.r11.8/compiler.31833905_0 2025-05-07T19:48:41.3572987Z 2025-05-07T19:48:41.4155283Z 2025-05-07T19:48:41.4163875Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:48:41.4166034Z [CHECK] nvidia-smi not found 2025-05-07T19:48:41.4166939Z [INSTALL] Successfully installed CUDA 11.8.0 2025-05-07T19:48:41.4264903Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:41.4265531Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:41.4266163Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:41.4266529Z env: 2025-05-07T19:48:41.4266789Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:41.4267101Z BUILD_ENV: build_binary 2025-05-07T19:48:41.4267385Z BUILD_TARGET: default 2025-05-07T19:48:41.4267652Z BUILD_VARIANT: cuda 2025-05-07T19:48:41.4267922Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:48:41.4268179Z ##[endgroup] 2025-05-07T19:48:41.8489034Z ################################################################################ 2025-05-07T19:48:41.8490078Z # Install PyTorch (PIP) 2025-05-07T19:48:41.8490734Z # 2025-05-07T19:48:41.8506202Z # [2025-05-07T19:48:41.849Z] + install_pytorch_pip build_binary nightly cuda/11.8.0 2025-05-07T19:48:41.8506920Z ################################################################################ 2025-05-07T19:48:41.8507184Z 2025-05-07T19:48:41.8550021Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:48:42.7383556Z Channels: 2025-05-07T19:48:42.7384364Z - conda-forge 2025-05-07T19:48:42.7384718Z Platform: linux-64 2025-05-07T19:48:52.3288679Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:48:53.8761458Z Solving environment: | / - done 2025-05-07T19:48:54.0551750Z 2025-05-07T19:48:54.0552094Z ## Package Plan ## 2025-05-07T19:48:54.0552284Z 2025-05-07T19:48:54.0552723Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:48:54.0553294Z 2025-05-07T19:48:54.0553412Z added / updated specs: 2025-05-07T19:48:54.0553791Z - numpy 2025-05-07T19:48:54.0553925Z 2025-05-07T19:48:54.0553930Z 2025-05-07T19:48:54.0554070Z The following packages will be downloaded: 2025-05-07T19:48:54.0554336Z 2025-05-07T19:48:54.0554494Z package | build 2025-05-07T19:48:54.0554846Z ---------------------------|----------------- 2025-05-07T19:48:54.0555292Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:48:54.0555796Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:48:54.0556334Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:48:54.0556961Z numpy-2.2.5 | py311h5d046bc_0 8.6 MB conda-forge 2025-05-07T19:48:54.0557362Z ------------------------------------------------------------ 2025-05-07T19:48:54.0557743Z Total: 8.7 MB 2025-05-07T19:48:54.0557957Z 2025-05-07T19:48:54.0558089Z The following NEW packages will be INSTALLED: 2025-05-07T19:48:54.0558341Z 2025-05-07T19:48:54.0558581Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:48:54.0559131Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:48:54.0559667Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:48:54.0560196Z numpy conda-forge/linux-64::numpy-2.2.5-py311h5d046bc_0 2025-05-07T19:48:54.0560476Z 2025-05-07T19:48:54.0560480Z 2025-05-07T19:48:54.0560483Z 2025-05-07T19:48:54.0560632Z Downloading and Extracting Packages: ...working... 2025-05-07T19:48:54.0576720Z numpy-2.2.5 | 8.6 MB | | 0% 2025-05-07T19:48:54.0577029Z 2025-05-07T19:48:54.0579315Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:54.0579765Z 2025-05-07T19:48:54.0579769Z 2025-05-07T19:48:54.0582570Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:54.0583383Z 2025-05-07T19:48:54.0583395Z 2025-05-07T19:48:54.0583406Z 2025-05-07T19:48:54.1171495Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:54.1172442Z 2025-05-07T19:48:54.1261812Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.1262114Z 2025-05-07T19:48:54.1262450Z 2025-05-07T19:48:54.1262462Z 2025-05-07T19:48:54.1340055Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.1340416Z 2025-05-07T19:48:54.1340421Z 2025-05-07T19:48:54.1361328Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.1361621Z 2025-05-07T19:48:54.1554283Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.1580460Z numpy-2.2.5 | 8.6 MB | ###3 | 33% 2025-05-07T19:48:54.1580968Z 2025-05-07T19:48:54.1581041Z 2025-05-07T19:48:54.1581048Z 2025-05-07T19:48:54.1627016Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.1627330Z 2025-05-07T19:48:54.1627334Z 2025-05-07T19:48:54.1627583Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.1627894Z 2025-05-07T19:48:54.1629642Z 2025-05-07T19:48:54.2241488Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:54.5764491Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:48:54.5764962Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:48:54.5767484Z numpy-2.2.5 | 8.6 MB | ########## | 100% 2025-05-07T19:48:54.5767892Z 2025-05-07T19:48:54.5768287Z 2025-05-07T19:48:54.5768608Z  2025-05-07T19:48:54.5770035Z 2025-05-07T19:48:54.5770040Z 2025-05-07T19:48:54.5770248Z  2025-05-07T19:48:54.5770512Z 2025-05-07T19:48:54.5770516Z 2025-05-07T19:48:54.5770520Z 2025-05-07T19:48:54.5770728Z  done 2025-05-07T19:48:54.6780041Z Preparing transaction: | done 2025-05-07T19:48:54.7789479Z Verifying transaction: - done 2025-05-07T19:48:54.8797326Z Executing transaction: | done 2025-05-07T19:48:54.9802509Z ################################################################################ 2025-05-07T19:48:54.9802951Z # Install Package From PyTorch PIP: torch 2025-05-07T19:48:54.9803327Z # 2025-05-07T19:48:54.9827704Z # [2025-05-07T19:48:54.981Z] + install_from_pytorch_pip build_binary torch nightly cuda/11.8.0 2025-05-07T19:48:54.9828318Z ################################################################################ 2025-05-07T19:48:54.9828583Z 2025-05-07T19:48:54.9846791Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:55.0709987Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:55.0710536Z ################################################################################ 2025-05-07T19:48:55.0710955Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:48:55.0711297Z # 2025-05-07T19:48:55.0729010Z # [2025-05-07T19:48:55.072Z] + __prepare_pip_arguments torch nightly cuda/11.8.0 2025-05-07T19:48:55.0730494Z ################################################################################ 2025-05-07T19:48:55.0731205Z 2025-05-07T19:48:55.0751714Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:48:55.0787033Z [INSTALL] Extracted package variant: cu118 2025-05-07T19:48:55.0802525Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:48:55.0803403Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:48:55.0810071Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:48:55.0821169Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu118/ ... 2025-05-07T19:48:55.0842226Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:50:13.7815497Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:50:13.7816979Z 2025-05-07T19:50:13.7817198Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:50:13.7817599Z Collecting torch 2025-05-07T19:50:13.7818274Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp311-cp311-manylinux_2_28_x86_64.whl.metadata (29 kB) 2025-05-07T19:50:13.7819048Z Collecting filelock (from torch) 2025-05-07T19:50:13.7819545Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:50:13.7820512Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from torch) (4.13.2) 2025-05-07T19:50:13.7821255Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:50:13.7821766Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:50:13.7822684Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 208.8 MB/s eta 0:00:00 2025-05-07T19:50:13.7823031Z Collecting networkx (from torch) 2025-05-07T19:50:13.7823545Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:50:13.7824225Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 146.8 MB/s eta 0:00:00 2025-05-07T19:50:13.7825136Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from torch) (3.1.6) 2025-05-07T19:50:13.7825818Z Collecting fsspec (from torch) 2025-05-07T19:50:13.7826317Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:50:13.7826918Z Collecting nvidia-cuda-nvrtc-cu11==11.8.89 (from torch) 2025-05-07T19:50:13.7827638Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_nvrtc_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (23.2 MB) 2025-05-07T19:50:13.7828457Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.2/23.2 MB 160.4 MB/s eta 0:00:00 2025-05-07T19:50:13.7828886Z Collecting nvidia-cuda-runtime-cu11==11.8.89 (from torch) 2025-05-07T19:50:13.7829617Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_runtime_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (875 kB) 2025-05-07T19:50:13.7830440Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 875.6/875.6 kB 89.8 MB/s eta 0:00:00 2025-05-07T19:50:13.7830836Z Collecting nvidia-cuda-cupti-cu11==11.8.87 (from torch) 2025-05-07T19:50:13.7831618Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_cupti_cu11-11.8.87-py3-none-manylinux1_x86_64.whl (13.1 MB) 2025-05-07T19:50:13.7832557Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 13.1/13.1 MB 188.0 MB/s eta 0:00:00 2025-05-07T19:50:13.7833147Z Collecting nvidia-cudnn-cu11==9.1.0.70 (from torch) 2025-05-07T19:50:13.7833916Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cudnn_cu11-9.1.0.70-py3-none-manylinux2014_x86_64.whl (663.9 MB) 2025-05-07T19:50:13.7834766Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 663.9/663.9 MB 50.4 MB/s eta 0:00:00 2025-05-07T19:50:13.7835206Z Collecting nvidia-cublas-cu11==11.11.3.6 (from torch) 2025-05-07T19:50:13.7835979Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cublas_cu11-11.11.3.6-py3-none-manylinux1_x86_64.whl (417.9 MB) 2025-05-07T19:50:13.7836838Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 417.9/417.9 MB 79.9 MB/s eta 0:00:00 2025-05-07T19:50:13.7837277Z Collecting nvidia-cufft-cu11==10.9.0.58 (from torch) 2025-05-07T19:50:13.7838016Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cufft_cu11-10.9.0.58-py3-none-manylinux1_x86_64.whl (168.4 MB) 2025-05-07T19:50:13.7838877Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 168.4/168.4 MB 210.7 MB/s eta 0:00:00 2025-05-07T19:50:13.7839431Z Collecting nvidia-curand-cu11==10.3.0.86 (from torch) 2025-05-07T19:50:13.7840172Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_curand_cu11-10.3.0.86-py3-none-manylinux1_x86_64.whl (58.1 MB) 2025-05-07T19:50:13.7841027Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 58.1/58.1 MB 201.5 MB/s eta 0:00:00 2025-05-07T19:50:13.7841446Z Collecting nvidia-cusolver-cu11==11.4.1.48 (from torch) 2025-05-07T19:50:13.7842225Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusolver_cu11-11.4.1.48-py3-none-manylinux1_x86_64.whl (128.2 MB) 2025-05-07T19:50:13.7843100Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 128.2/128.2 MB 213.6 MB/s eta 0:00:00 2025-05-07T19:50:13.7843521Z Collecting nvidia-cusparse-cu11==11.7.5.86 (from torch) 2025-05-07T19:50:13.7844302Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusparse_cu11-11.7.5.86-py3-none-manylinux1_x86_64.whl (204.1 MB) 2025-05-07T19:50:13.7845267Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 204.1/204.1 MB 165.1 MB/s eta 0:00:00 2025-05-07T19:50:13.7845650Z Collecting nvidia-nccl-cu11==2.21.5 (from torch) 2025-05-07T19:50:13.7846309Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nccl_cu11-2.21.5-py3-none-manylinux2014_x86_64.whl (147.8 MB) 2025-05-07T19:50:13.7847088Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 147.8/147.8 MB 191.1 MB/s eta 0:00:00 2025-05-07T19:50:13.7847473Z Collecting nvidia-nvtx-cu11==11.8.86 (from torch) 2025-05-07T19:50:13.7848126Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nvtx_cu11-11.8.86-py3-none-manylinux1_x86_64.whl (99 kB) 2025-05-07T19:50:13.7848906Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:50:13.7849759Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:50:13.7851097Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:50:13.7851998Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:50:13.7852555Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:50:13.7853218Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 47.7 MB/s eta 0:00:00 2025-05-07T19:50:13.7853980Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:50:13.7855109Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp311-cp311-manylinux_2_28_x86_64.whl (916.4 MB) 2025-05-07T19:50:13.7855961Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 916.4/916.4 MB 26.9 MB/s eta 0:00:00 2025-05-07T19:50:13.7856836Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.5 MB) 2025-05-07T19:50:13.7857747Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.5/153.5 MB 54.7 MB/s eta 0:00:00 2025-05-07T19:50:13.7859223Z Installing collected packages: mpmath, sympy, pytorch-triton, nvidia-nvtx-cu11, nvidia-nccl-cu11, nvidia-cusparse-cu11, nvidia-curand-cu11, nvidia-cufft-cu11, nvidia-cuda-runtime-cu11, nvidia-cuda-nvrtc-cu11, nvidia-cuda-cupti-cu11, nvidia-cublas-cu11, networkx, fsspec, filelock, nvidia-cusolver-cu11, nvidia-cudnn-cu11, torch 2025-05-07T19:50:13.7860635Z 2025-05-07T19:50:13.7862304Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu11-11.11.3.6 nvidia-cuda-cupti-cu11-11.8.87 nvidia-cuda-nvrtc-cu11-11.8.89 nvidia-cuda-runtime-cu11-11.8.89 nvidia-cudnn-cu11-9.1.0.70 nvidia-cufft-cu11-10.9.0.58 nvidia-curand-cu11-10.3.0.86 nvidia-cusolver-cu11-11.4.1.48 nvidia-cusparse-cu11-11.7.5.86 nvidia-nccl-cu11-2.21.5 nvidia-nvtx-cu11-11.8.86 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu118 2025-05-07T19:50:13.7864067Z 2025-05-07T19:50:16.0595598Z torch 2.8.0.dev20250507+cu118 2025-05-07T19:50:16.0597657Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu118) 2025-05-07T19:50:19.2797648Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:50:22.6817241Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu118 2025-05-07T19:50:22.6817827Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:50:25.9263074Z True 2025-05-07T19:50:25.9264432Z True 2025-05-07T19:50:25.9264768Z 2025-05-07T19:50:26.0013627Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:50:26.0091205Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:50:26.0091914Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:50:26.0092718Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:26.0093050Z env: 2025-05-07T19:50:26.0093273Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:26.0093590Z BUILD_ENV: build_binary 2025-05-07T19:50:26.0093855Z BUILD_TARGET: default 2025-05-07T19:50:26.0094091Z BUILD_VARIANT: cuda 2025-05-07T19:50:26.0094354Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:26.0094600Z ##[endgroup] 2025-05-07T19:50:26.4703255Z /github/home/miniconda/bin/conda 2025-05-07T19:50:26.4704146Z ################################################################################ 2025-05-07T19:50:26.4704745Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:50:26.4705224Z # 2025-05-07T19:50:26.4720412Z # [2025-05-07T19:50:26.471Z] + collect_pytorch_env_info build_binary 2025-05-07T19:50:26.4720857Z ################################################################################ 2025-05-07T19:50:26.4721132Z 2025-05-07T19:50:26.4742194Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:26.5601172Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:26.5618276Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:50:26.5619395Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:50:26.5619848Z 2025-05-07T19:50:26.6547170Z 2025-05-07T19:50:26.6548589Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:50:26.6568708Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:50:32.0174692Z Collecting environment information... 2025-05-07T19:50:32.0175788Z PyTorch version: 2.8.0.dev20250507+cu118 2025-05-07T19:50:32.0176794Z Is debug build: False 2025-05-07T19:50:32.0177549Z CUDA used to build PyTorch: 11.8 2025-05-07T19:50:32.0178443Z ROCM used to build PyTorch: N/A 2025-05-07T19:50:32.0178997Z 2025-05-07T19:50:32.0179349Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:50:32.0179689Z GCC version: Could not collect 2025-05-07T19:50:32.0180351Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:32.0180989Z CMake version: version 4.0.2 2025-05-07T19:50:32.0181311Z Libc version: glibc-2.34 2025-05-07T19:50:32.0181489Z 2025-05-07T19:50:32.0181829Z Python version: 3.11.11 | packaged by conda-forge | (main, Mar 3 2025, 20:43:55) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:50:32.0182646Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:50:32.0183126Z Is CUDA available: False 2025-05-07T19:50:32.0183427Z CUDA runtime version: 11.8.89 2025-05-07T19:50:32.0183777Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:50:32.0184149Z GPU models and configuration: Could not collect 2025-05-07T19:50:32.0184520Z Nvidia driver version: Could not collect 2025-05-07T19:50:32.0184886Z cuDNN version: Could not collect 2025-05-07T19:50:32.0185186Z HIP runtime version: N/A 2025-05-07T19:50:32.0185489Z MIOpen runtime version: N/A 2025-05-07T19:50:32.0186021Z Is XNNPACK available: True 2025-05-07T19:50:32.0186420Z 2025-05-07T19:50:32.0186517Z CPU: 2025-05-07T19:50:32.0186802Z Architecture: x86_64 2025-05-07T19:50:32.0187212Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:50:32.0187682Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:50:32.0188123Z Byte Order: Little Endian 2025-05-07T19:50:32.0188519Z CPU(s): 96 2025-05-07T19:50:32.0188863Z On-line CPU(s) list: 0-95 2025-05-07T19:50:32.0189648Z Vendor ID: GenuineIntel 2025-05-07T19:50:32.0190276Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:50:32.0190729Z CPU family: 6 2025-05-07T19:50:32.0191042Z Model: 85 2025-05-07T19:50:32.0191382Z Thread(s) per core: 2 2025-05-07T19:50:32.0191747Z Core(s) per socket: 24 2025-05-07T19:50:32.0192076Z Socket(s): 2 2025-05-07T19:50:32.0192546Z Stepping: 7 2025-05-07T19:50:32.0192892Z BogoMIPS: 5999.98 2025-05-07T19:50:32.0195430Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:50:32.0198044Z Hypervisor vendor: KVM 2025-05-07T19:50:32.0198526Z Virtualization type: full 2025-05-07T19:50:32.0198883Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:50:32.0199288Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:50:32.0199690Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:50:32.0200063Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:50:32.0200428Z NUMA node(s): 2 2025-05-07T19:50:32.0200744Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:50:32.0201108Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:50:32.0201579Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:50:32.0202160Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:50:32.0202686Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:50:32.0203292Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:50:32.0203901Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:50:32.0204511Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:50:32.0205167Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:50:32.0205549Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:50:32.0205959Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:50:32.0206370Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:50:32.0206930Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:50:32.0207790Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:50:32.0208431Z Vulnerability Srbds: Not affected 2025-05-07T19:50:32.0208841Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:50:32.0209084Z 2025-05-07T19:50:32.0209197Z Versions of relevant libraries: 2025-05-07T19:50:32.0209674Z [pip3] numpy==2.2.5 2025-05-07T19:50:32.0209968Z [pip3] nvidia-cublas-cu11==11.11.3.6 2025-05-07T19:50:32.0210302Z [pip3] nvidia-cuda-cupti-cu11==11.8.87 2025-05-07T19:50:32.0210668Z [pip3] nvidia-cuda-nvrtc-cu11==11.8.89 2025-05-07T19:50:32.0211008Z [pip3] nvidia-cuda-runtime-cu11==11.8.89 2025-05-07T19:50:32.0211377Z [pip3] nvidia-cudnn-cu11==9.1.0.70 2025-05-07T19:50:32.0211693Z [pip3] nvidia-cufft-cu11==10.9.0.58 2025-05-07T19:50:32.0212134Z [pip3] nvidia-curand-cu11==10.3.0.86 2025-05-07T19:50:32.0212467Z [pip3] nvidia-cusolver-cu11==11.4.1.48 2025-05-07T19:50:32.0212922Z [pip3] nvidia-cusparse-cu11==11.7.5.86 2025-05-07T19:50:32.0213277Z [pip3] nvidia-nccl-cu11==2.21.5 2025-05-07T19:50:32.0213581Z [pip3] nvidia-nvtx-cu11==11.8.86 2025-05-07T19:50:32.0213921Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:50:32.0214258Z [pip3] torch==2.8.0.dev20250507+cu118 2025-05-07T19:50:32.0214716Z [conda] cuda-cudart 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0215391Z [conda] cuda-cudart-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0215946Z [conda] cuda-cupti 11.8.87 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0216494Z [conda] cuda-libraries 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0217037Z [conda] cuda-libraries-dev 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0217603Z [conda] cuda-nvrtc 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0218114Z [conda] cuda-nvrtc-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0218649Z [conda] cuda-nvtx 11.8.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0219149Z [conda] cuda-runtime 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0219683Z [conda] libcublas 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0220221Z [conda] libcublas-dev 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0220731Z [conda] libcufft 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0221266Z [conda] libcufft-dev 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0221773Z [conda] libcurand 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0222313Z [conda] libcurand-dev 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0222857Z [conda] libcusolver 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0223383Z [conda] libcusolver-dev 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0223934Z [conda] libcusparse 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0224464Z [conda] libcusparse-dev 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:32.0224987Z [conda] numpy 2.2.5 py311h5d046bc_0 conda-forge 2025-05-07T19:50:32.0225493Z [conda] nvidia-cublas-cu11 11.11.3.6 pypi_0 pypi 2025-05-07T19:50:32.0226013Z [conda] nvidia-cuda-cupti-cu11 11.8.87 pypi_0 pypi 2025-05-07T19:50:32.0226561Z [conda] nvidia-cuda-nvrtc-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:50:32.0227079Z [conda] nvidia-cuda-runtime-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:50:32.0227616Z [conda] nvidia-cudnn-cu11 9.1.0.70 pypi_0 pypi 2025-05-07T19:50:32.0228107Z [conda] nvidia-cufft-cu11 10.9.0.58 pypi_0 pypi 2025-05-07T19:50:32.0228632Z [conda] nvidia-curand-cu11 10.3.0.86 pypi_0 pypi 2025-05-07T19:50:32.0229167Z [conda] nvidia-cusolver-cu11 11.4.1.48 pypi_0 pypi 2025-05-07T19:50:32.0229681Z [conda] nvidia-cusparse-cu11 11.7.5.86 pypi_0 pypi 2025-05-07T19:50:32.0230212Z [conda] nvidia-nccl-cu11 2.21.5 pypi_0 pypi 2025-05-07T19:50:32.0230698Z [conda] nvidia-nvtx-cu11 11.8.86 pypi_0 pypi 2025-05-07T19:50:32.0231220Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:50:32.0231847Z [conda] torch 2.8.0.dev20250507+cu118 pypi_0 pypi 2025-05-07T19:50:32.0232164Z 2025-05-07T19:50:32.1095412Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:50:32.1096060Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:50:32.1096675Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:32.1097009Z env: 2025-05-07T19:50:32.1097260Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:32.1097572Z BUILD_ENV: build_binary 2025-05-07T19:50:32.1097945Z BUILD_TARGET: default 2025-05-07T19:50:32.1098182Z BUILD_VARIANT: cuda 2025-05-07T19:50:32.1098432Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:32.1098696Z ##[endgroup] 2025-05-07T19:50:32.6062066Z ################################################################################ 2025-05-07T19:50:32.6063153Z # Install cuDNN 2025-05-07T19:50:32.6063838Z # 2025-05-07T19:50:32.6075764Z # [2025-05-07T19:50:32.607Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 11.8.0 2025-05-07T19:50:32.6077451Z ################################################################################ 2025-05-07T19:50:32.6078138Z 2025-05-07T19:50:32.6090527Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:32.6978829Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:32.6980292Z [INSTALL] cuda_concat_version is determined to be: 118 2025-05-07T19:50:32.6980706Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:32.6980971Z 2025-05-07T19:50:32.6992703Z 2025-05-07T19:50:32.6993251Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:32.6993949Z 2025-05-07T19:50:32.7009042Z 2025-05-07T19:50:32.7026416Z [INSTALL] Downloading cuDNN to /tmp/tmp.HTZzCVKABb ... 2025-05-07T19:50:32.7047858Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/redist/cudnn/v8.7.0/local_installers/11.8/cudnn-linux-x86_64-8.7.0.84_cuda11-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:50:34.3998204Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:50:34.3998597Z + tar -xvf cudnn.tar.xz 2025-05-07T19:50:34.3998771Z 2025-05-07T19:50:34.4026319Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/ 2025-05-07T19:50:34.4027027Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/ 2025-05-07T19:50:34.4027522Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static.a 2025-05-07T19:50:36.7885224Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static_v8.a 2025-05-07T19:50:36.7887065Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static.a 2025-05-07T19:50:39.0429951Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static_v8.a 2025-05-07T19:50:39.0431721Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static.a 2025-05-07T19:50:47.2671231Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static_v8.a 2025-05-07T19:50:47.2673339Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static.a 2025-05-07T19:50:48.8616274Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static_v8.a 2025-05-07T19:50:48.8617942Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static.a 2025-05-07T19:50:50.5494957Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static_v8.a 2025-05-07T19:50:50.5496721Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static.a 2025-05-07T19:50:52.0598016Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static_v8.a 2025-05-07T19:50:52.0599673Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8 2025-05-07T19:50:52.0600402Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so 2025-05-07T19:50:52.0600903Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8.7.0 2025-05-07T19:50:52.0615200Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8 2025-05-07T19:50:52.0616207Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so 2025-05-07T19:50:52.0616745Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8.7.0 2025-05-07T19:50:54.4364635Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8 2025-05-07T19:50:54.4366317Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so 2025-05-07T19:50:54.4367890Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8.7.0 2025-05-07T19:50:56.6927455Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so 2025-05-07T19:50:56.6928045Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8 2025-05-07T19:50:56.6928595Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8.7.0 2025-05-07T19:51:05.2235693Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so 2025-05-07T19:51:05.2237397Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8.7.0 2025-05-07T19:51:06.8428507Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8 2025-05-07T19:51:06.8429991Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8.7.0 2025-05-07T19:51:08.5294606Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so 2025-05-07T19:51:08.5296303Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8 2025-05-07T19:51:08.5297901Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8.7.0 2025-05-07T19:51:10.0427576Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so 2025-05-07T19:51:10.0428411Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8 2025-05-07T19:51:10.0428926Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/ 2025-05-07T19:51:10.0429382Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_v8.h 2025-05-07T19:51:10.0429922Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer_v8.h 2025-05-07T19:51:10.0430456Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train_v8.h 2025-05-07T19:51:10.0431037Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend_v8.h 2025-05-07T19:51:10.0431603Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer_v8.h 2025-05-07T19:51:10.0432140Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train_v8.h 2025-05-07T19:51:10.0432873Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer_v8.h 2025-05-07T19:51:10.0433616Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train_v8.h 2025-05-07T19:51:10.0434196Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version_v8.h 2025-05-07T19:51:10.0434721Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn.h 2025-05-07T19:51:10.0435219Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer.h 2025-05-07T19:51:10.0435770Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train.h 2025-05-07T19:51:10.0436303Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend.h 2025-05-07T19:51:10.0436861Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer.h 2025-05-07T19:51:10.0437383Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train.h 2025-05-07T19:51:10.0437946Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer.h 2025-05-07T19:51:10.0438501Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train.h 2025-05-07T19:51:10.0439035Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version.h 2025-05-07T19:51:10.0439652Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/LICENSE 2025-05-07T19:51:10.0451278Z 2025-05-07T19:51:10.0452542Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:51:10.0454037Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:10.0454783Z 2025-05-07T19:51:10.0470697Z 2025-05-07T19:51:10.0471396Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:10.0472213Z 2025-05-07T19:51:10.0483684Z 2025-05-07T19:51:10.0484800Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:51:10.0486942Z 2025-05-07T19:51:10.0515487Z 2025-05-07T19:51:10.0517361Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:51:10.0518553Z 2025-05-07T19:51:11.5111732Z 2025-05-07T19:51:11.5112310Z /__w/FBGEMM/FBGEMM 2025-05-07T19:51:11.5113377Z + rm -rf /tmp/tmp.HTZzCVKABb 2025-05-07T19:51:11.5113945Z 2025-05-07T19:51:11.9333844Z 2025-05-07T19:51:11.9349295Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:51:11.9350347Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:11.9351044Z 2025-05-07T19:51:12.3489043Z 2025-05-07T19:51:12.3489831Z [INSTALL] Successfully installed cuDNN (for CUDA 11.8.0) 2025-05-07T19:51:12.3554018Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:51:12.3554738Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:51:12.3555441Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:51:12.3555833Z env: 2025-05-07T19:51:12.3556091Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:51:12.3556465Z BUILD_ENV: build_binary 2025-05-07T19:51:12.3556779Z BUILD_TARGET: default 2025-05-07T19:51:12.3557047Z BUILD_VARIANT: cuda 2025-05-07T19:51:12.3557345Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:51:12.3557628Z ##[endgroup] 2025-05-07T19:51:12.8092826Z ################################################################################ 2025-05-07T19:51:12.8093808Z # Prepare FBGEMM-GPU Build 2025-05-07T19:51:12.8094268Z # 2025-05-07T19:51:12.8110470Z # [2025-05-07T19:51:12.810Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:51:12.8111594Z ################################################################################ 2025-05-07T19:51:12.8111842Z 2025-05-07T19:51:12.8123391Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:51:12.8951153Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:51:12.8967991Z [BUILD] Running git submodules update ... 2025-05-07T19:51:12.8988337Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:51:12.9323948Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:51:12.9324479Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:51:12.9324994Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:51:12.9325457Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:51:12.9325915Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:51:12.9326408Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:51:12.9326862Z Synchronizing submodule url for '../external/json' 2025-05-07T19:51:12.9356163Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:51:12.9788528Z [BUILD] Installing other build dependencies ... 2025-05-07T19:51:12.9810200Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:51:15.0810737Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:51:15.0997305Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:51:15.1065596Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:51:15.2164220Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:51:15.2194836Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:51:15.2245077Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:51:15.2249057Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:51:15.2250742Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:51:15.2252100Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:51:15.2495134Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:51:15.2535344Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:51:15.2579191Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:51:15.2762505Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:51:15.2793979Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:51:15.2841496Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:51:15.2842884Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:51:15.2852382Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:51:15.3025773Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:51:15.3054962Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:51:15.3218062Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:51:15.3246633Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:51:15.3436639Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:51:15.3464149Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:51:15.3530336Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:51:15.3532432Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:51:15.3576480Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:51:15.3578015Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:51:15.3629236Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:51:15.3728968Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:51:15.3767658Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:51:15.3825598Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:51:15.3836438Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:51:15.3851580Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:51:15.4092262Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:51:15.4116139Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:51:15.4216358Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:51:15.4316489Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:51:15.5435859Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 254.6 MB/s eta 0:00:00 2025-05-07T19:51:15.5466016Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:51:15.5556692Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:51:15.5625508Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:51:15.5693573Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:51:15.5753184Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:51:15.5828924Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:51:15.5883757Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:51:15.7296079Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:51:16.5663381Z 2025-05-07T19:51:16.5689127Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:51:16.5692521Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:51:16.6973281Z ################################################################################ 2025-05-07T19:51:16.6974425Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:51:16.6975261Z # 2025-05-07T19:51:16.6989381Z # [2025-05-07T19:51:16.698Z] + install_triton_pip build_binary 2025-05-07T19:51:16.6990285Z ################################################################################ 2025-05-07T19:51:16.6990555Z 2025-05-07T19:51:16.6990812Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:51:16.6991321Z ################################################################################ 2025-05-07T19:51:16.6991734Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:51:16.6992128Z # 2025-05-07T19:51:16.7010558Z # [2025-05-07T19:51:16.700Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:51:16.7011188Z ################################################################################ 2025-05-07T19:51:16.7011472Z 2025-05-07T19:51:16.7034348Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:51:16.7895946Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:51:16.7896497Z ################################################################################ 2025-05-07T19:51:16.7896893Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:51:16.7897199Z # 2025-05-07T19:51:16.7916799Z # [2025-05-07T19:51:16.791Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:51:16.7918383Z ################################################################################ 2025-05-07T19:51:16.7919109Z 2025-05-07T19:51:16.7965571Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:51:16.7980930Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:51:16.7982629Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:16.7985049Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:51:16.7992296Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:51:16.8017275Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:22.4197509Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:51:22.4201683Z torch 2.8.0.dev20250507+cu118 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:51:22.4203917Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:51:22.4205452Z 2025-05-07T19:51:22.4205675Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:22.4206165Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:51:22.4207082Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:51:22.4208526Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:51:22.4209806Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 158.4 MB/s eta 0:00:00 2025-05-07T19:51:22.4210244Z Installing collected packages: pytorch-triton 2025-05-07T19:51:22.4210624Z Attempting uninstall: pytorch-triton 2025-05-07T19:51:22.4211065Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:51:22.4211541Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:51:22.4211976Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:51:22.4212478Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:51:22.4212752Z 2025-05-07T19:51:24.5230410Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:51:24.5231614Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:51:26.5649526Z ################################################################################ 2025-05-07T19:51:26.5650855Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:51:26.5651394Z ################################################################################ 2025-05-07T19:51:26.5651642Z 2025-05-07T19:51:28.5537200Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:51:30.6040164Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:51:30.6040665Z [BUILD] Successfully ran git submodules update 2025-05-07T19:51:30.6112292Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:51:30.6113337Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:51:30.6113976Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:51:30.6114323Z env: 2025-05-07T19:51:30.6114553Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:51:30.6114884Z BUILD_ENV: build_binary 2025-05-07T19:51:30.6115153Z BUILD_TARGET: default 2025-05-07T19:51:30.6115392Z BUILD_VARIANT: cuda 2025-05-07T19:51:30.6115651Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:51:30.6115904Z ##[endgroup] 2025-05-07T19:51:31.0596265Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:51:31.0597340Z [BUILD] Extracted build target: default 2025-05-07T19:51:31.0598937Z [BUILD] Extracted build variant: cuda 2025-05-07T19:51:32.9420131Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:51:32.9420856Z 2025-05-07T19:51:33.0176249Z [CHECK] Binary cc found in PATH 2025-05-07T19:51:34.8492780Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:51:34.8493107Z 2025-05-07T19:51:34.9198317Z [CHECK] Binary gcc found in PATH 2025-05-07T19:51:36.7530546Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:51:36.7531149Z 2025-05-07T19:51:36.8260786Z [CHECK] Binary c++ found in PATH 2025-05-07T19:51:38.6745902Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:51:38.6746398Z 2025-05-07T19:51:38.7325048Z [CHECK] Binary g++ found in PATH 2025-05-07T19:51:40.6124657Z [BUILD] Extracted and set Python tag: py311 2025-05-07T19:51:40.6126090Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:51:40.6345346Z core = 24 2025-05-07T19:51:40.6558531Z sockets = 2 2025-05-07T19:51:40.6559524Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:51:40.6560130Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:51:40.6560457Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:51:40.6560822Z + rm -rf dist 2025-05-07T19:51:40.6560961Z 2025-05-07T19:51:40.6576036Z 2025-05-07T19:51:40.6576995Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:51:40.6577395Z 2025-05-07T19:51:43.7532292Z INFO:root:running clean 2025-05-07T19:51:43.7532641Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:51:43.7533769Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:51:43.7534881Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:51:43.7535388Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:51:43.7535989Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:51:43.7536612Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:51:43.7537270Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:51:43.7537683Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:51:43.7538994Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:51:44.0681939Z 2025-05-07T19:51:44.0682494Z [BUILD] Printing git status ... 2025-05-07T19:51:44.0682873Z + git status 2025-05-07T19:51:44.0683007Z 2025-05-07T19:51:44.5155267Z HEAD detached at pull/4066/merge 2025-05-07T19:51:44.5156196Z Untracked files: 2025-05-07T19:51:44.5157095Z (use "git add ..." to include in what will be committed) 2025-05-07T19:51:44.5158238Z ../build_only/ 2025-05-07T19:51:44.5158888Z ../collect_env.py 2025-05-07T19:51:44.5159594Z fbgemm_gpu/docs/version.py 2025-05-07T19:51:44.5160109Z 2025-05-07T19:51:44.5161367Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:51:44.5161861Z 2025-05-07T19:51:44.5161951Z + git diff 2025-05-07T19:51:44.5162092Z 2025-05-07T19:51:44.5443313Z 2025-05-07T19:51:44.5443626Z ################################################################################ 2025-05-07T19:51:44.5444195Z # Configure FBGEMM-GPU Build 2025-05-07T19:51:44.5444509Z # 2025-05-07T19:51:44.5464690Z # [2025-05-07T19:51:44.545Z] + __configure_fbgemm_gpu_build 2025-05-07T19:51:44.5465615Z ################################################################################ 2025-05-07T19:51:44.5465945Z 2025-05-07T19:51:44.5477135Z [BUILD] Setting the build target: default ... 2025-05-07T19:51:44.5478531Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:51:46.4649123Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:51:46.4649991Z 2025-05-07T19:51:46.5287909Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:51:48.4258096Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:48.4258460Z 2025-05-07T19:51:48.5009818Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:51:50.4084209Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:50.4084496Z 2025-05-07T19:51:50.4838785Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:51:52.3927483Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:52.3927876Z 2025-05-07T19:51:52.4685635Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:51:54.4593046Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:51:54.4593639Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:51:54.4594016Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:51:54.4594348Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:51:54.4594752Z Build cuda_11.8.r11.8/compiler.31833905_0 ... 2025-05-07T19:51:54.4595138Z [BUILD] Setting the following CUDA targets: 7.0;8.0 2025-05-07T19:51:54.4595495Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:51:56.3488322Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:52:00.3475214Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:52:00.3476469Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:52:00.3477335Z 2025-05-07T19:52:00.7638355Z 2025-05-07T19:52:00.7639380Z [BUILD] Setting CUDA build args ... 2025-05-07T19:52:00.7644571Z [BUILD] Looking up CUDA version ... 2025-05-07T19:52:04.6870847Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:52:04.6871237Z 2025-05-07T19:52:06.5755207Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:06.5756185Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:06.5756680Z 2025-05-07T19:52:06.5756813Z [BUILD] Setting NVCC flags ... 2025-05-07T19:52:06.5757792Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++17 -Xcompiler -std=c++17 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:52:06.5758662Z 2025-05-07T19:52:06.9850575Z 2025-05-07T19:52:06.9851069Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:52:08.7996378Z 2025-05-07T19:52:08.7998419Z -std=c++17 -Xcompiler -std=c++17 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:52:08.7999914Z 2025-05-07T19:52:08.8755936Z 2025-05-07T19:52:08.8756439Z [BUILD] Setting CUDA build args ... 2025-05-07T19:52:08.8757462Z + conda run -n build_binary c++ --version 2025-05-07T19:52:08.8758157Z 2025-05-07T19:52:10.7668274Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:10.7669240Z Target: x86_64-conda-linux-gnu 2025-05-07T19:52:10.7669543Z Thread model: posix 2025-05-07T19:52:10.7669869Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:52:10.7670532Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:10.7671008Z 2025-05-07T19:52:10.8414370Z 2025-05-07T19:52:10.8414909Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:52:10.8415237Z 2025-05-07T19:52:12.7324564Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:12.7325511Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:12.7326004Z 2025-05-07T19:52:12.7326221Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:52:14.6038935Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:52:14.6039552Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:52:14.6041521Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0' -DCMAKE_CXX_STANDARD=17 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:52:14.6043488Z ################################################################################ 2025-05-07T19:52:14.6043853Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:52:14.6060795Z # 2025-05-07T19:52:14.6061258Z # [2025-05-07T19:52:14.605Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:52:14.6061780Z ################################################################################ 2025-05-07T19:52:14.6062058Z 2025-05-07T19:52:14.6062272Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:52:14.6066409Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0' --config-setting=--build-option=-DCMAKE_CXX_STANDARD=17 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py311 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:52:14.6070305Z 2025-05-07T19:52:16.4643440Z * Getting build dependencies for wheel... 2025-05-07T19:52:17.7155073Z INFO:root:running egg_info 2025-05-07T19:52:17.7179683Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:52:17.7180640Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:52:17.7181504Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:52:17.7183076Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:52:17.7183908Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:52:17.7184970Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:17.7235977Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:17.7251616Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:17.7252675Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:52:17.7254080Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:52:17.7255194Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:52:17.7255701Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:52:17.7256277Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:52:17.7256884Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:52:17.7257454Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:52:17.7257876Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:52:17.7259176Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:52:18.0564807Z * Building wheel... 2025-05-07T19:52:19.2856297Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-7vekhhi3', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py311', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:52:19.2860306Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:52:19.2862787Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-7vekhhi3', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--python-tag=py311', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:52:19.2863882Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:52:19.2864490Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:52:19.2865072Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:52:19.2865661Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:52:19.2866105Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:52:19.2871836Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17'] 2025-05-07T19:52:19.2877896Z 2025-05-07T19:52:19.2877902Z 2025-05-07T19:52:19.2878091Z -------------------------------------------------------------------------------- 2025-05-07T19:52:19.2878536Z -- Trying 'Ninja' generator 2025-05-07T19:52:19.2878827Z -------------------------------- 2025-05-07T19:52:19.2879152Z --------------------------- 2025-05-07T19:52:19.2879533Z ---------------------- 2025-05-07T19:52:19.2879794Z ----------------- 2025-05-07T19:52:19.2880028Z ------------ 2025-05-07T19:52:19.2880273Z ------- 2025-05-07T19:52:19.2880492Z -- 2025-05-07T19:52:19.3330789Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:52:19.3332073Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:52:19.3332528Z CMake. 2025-05-07T19:52:19.3332680Z 2025-05-07T19:52:19.3332978Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:52:19.3333568Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:52:19.3334117Z to work with policies introduced by or earlier. 2025-05-07T19:52:19.3334387Z 2025-05-07T19:52:19.3334391Z 2025-05-07T19:52:19.3334595Z Not searching for unused variables given on the command line. 2025-05-07T19:52:19.4162079Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:52:19.4257221Z -- Detecting C compiler ABI info 2025-05-07T19:52:19.5435192Z -- Detecting C compiler ABI info - done 2025-05-07T19:52:19.5563632Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:52:19.5565217Z -- Detecting C compile features 2025-05-07T19:52:19.5568791Z -- Detecting C compile features - done 2025-05-07T19:52:19.6931921Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:52:19.7000817Z -- Detecting CXX compiler ABI info 2025-05-07T19:52:19.8243668Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:52:19.8377678Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:52:19.8379272Z -- Detecting CXX compile features 2025-05-07T19:52:19.8385412Z -- Detecting CXX compile features - done 2025-05-07T19:52:19.8400486Z -- Configuring done (0.6s) 2025-05-07T19:52:19.8450748Z -- Generating done (0.0s) 2025-05-07T19:52:19.8465698Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:52:19.8502596Z -- 2025-05-07T19:52:19.8502985Z ------- 2025-05-07T19:52:19.8503205Z ------------ 2025-05-07T19:52:19.8503455Z ----------------- 2025-05-07T19:52:19.8503683Z ---------------------- 2025-05-07T19:52:19.8503973Z --------------------------- 2025-05-07T19:52:19.8504256Z -------------------------------- 2025-05-07T19:52:19.8504554Z -- Trying 'Ninja' generator - success 2025-05-07T19:52:19.8504960Z -------------------------------------------------------------------------------- 2025-05-07T19:52:19.8505253Z 2025-05-07T19:52:19.8516292Z Configuring Project 2025-05-07T19:52:19.8516578Z Working directory: 2025-05-07T19:52:19.8516957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build 2025-05-07T19:52:19.8517397Z Command: 2025-05-07T19:52:19.8536342Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install -DPYTHON_VERSION_STRING:STRING=3.11.11 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.11.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.11 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:52:19.8555149Z 2025-05-07T19:52:19.8932365Z 2025-05-07T19:52:19.8932740Z Not searching for unused variables given on the command line. 2025-05-07T19:52:19.8933075Z 2025-05-07T19:52:19.8934050Z ================================================================================ 2025-05-07T19:52:19.8934488Z Default C compiler flags 2025-05-07T19:52:19.8934879Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:52:19.8935196Z 2025-05-07T19:52:19.8936198Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:52:19.8937255Z ================================================================================ 2025-05-07T19:52:19.8937506Z 2025-05-07T19:52:19.8937510Z 2025-05-07T19:52:19.8937514Z 2025-05-07T19:52:19.8937740Z ================================================================================ 2025-05-07T19:52:19.8938079Z Default C++ compiler flags 2025-05-07T19:52:19.8938534Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:52:19.8938824Z 2025-05-07T19:52:19.8939648Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:52:19.8940646Z ================================================================================ 2025-05-07T19:52:19.8940889Z 2025-05-07T19:52:19.8940893Z 2025-05-07T19:52:19.8940896Z 2025-05-07T19:52:19.8941008Z ================================================================================ 2025-05-07T19:52:19.8941328Z AVX2_FLAGS: 2025-05-07T19:52:19.8941444Z 2025-05-07T19:52:19.8941563Z -mavx2 2025-05-07T19:52:19.8941769Z -mf16c 2025-05-07T19:52:19.8941952Z -mfma 2025-05-07T19:52:19.8942161Z -fopenmp 2025-05-07T19:52:19.8942401Z ================================================================================ 2025-05-07T19:52:19.8942618Z 2025-05-07T19:52:19.8942627Z 2025-05-07T19:52:19.8942631Z 2025-05-07T19:52:19.8942742Z ================================================================================ 2025-05-07T19:52:19.8943057Z AVX512_FLAGS: 2025-05-07T19:52:19.8943180Z 2025-05-07T19:52:19.8943261Z -mavx2 2025-05-07T19:52:19.8943462Z -mf16c 2025-05-07T19:52:19.8943642Z -mfma 2025-05-07T19:52:19.8943845Z -mavx512f 2025-05-07T19:52:19.8944036Z -mavx512bw 2025-05-07T19:52:19.8944247Z -mavx512dq 2025-05-07T19:52:19.8944452Z -mavx512vl 2025-05-07T19:52:19.8944642Z -fopenmp 2025-05-07T19:52:19.8944876Z ================================================================================ 2025-05-07T19:52:19.8945089Z 2025-05-07T19:52:19.8945093Z 2025-05-07T19:52:19.8945096Z 2025-05-07T19:52:19.8945203Z ================================================================================ 2025-05-07T19:52:19.8945544Z The project is built using scikit-build 2025-05-07T19:52:19.8945852Z ================================================================================ 2025-05-07T19:52:19.8946089Z 2025-05-07T19:52:19.8946094Z 2025-05-07T19:52:19.8946097Z 2025-05-07T19:52:19.8946206Z ================================================================================ 2025-05-07T19:52:19.8946520Z Build Settings 2025-05-07T19:52:19.8946644Z 2025-05-07T19:52:19.8946745Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:52:19.8947038Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:52:19.8947207Z 2025-05-07T19:52:19.8947300Z NVCC_VERBOSE : 2025-05-07T19:52:19.8947561Z CUDNN_INCLUDE_DIR : 2025-05-07T19:52:19.8947818Z CUDNN_LIBRARY : 2025-05-07T19:52:19.8948223Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:19.8948699Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:52:19.8948945Z 8.0 2025-05-07T19:52:19.8949048Z 2025-05-07T19:52:19.8949157Z HIP_ROOT_DIR : 2025-05-07T19:52:19.8949399Z HIPCC_VERBOSE : 2025-05-07T19:52:19.8949658Z AMDGPU_TARGETS : 2025-05-07T19:52:19.8949912Z PYTORCH_ROCM_ARCH : 2025-05-07T19:52:19.8950201Z ================================================================================ 2025-05-07T19:52:19.8950422Z 2025-05-07T19:52:20.0317140Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:52:20.1005319Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:52:21.0430186Z -- The CUDA compiler identification is NVIDIA 11.8.89 with host compiler Clang 16.0.6 2025-05-07T19:52:21.0540075Z -- Detecting CXX compiler ABI info 2025-05-07T19:52:21.1793434Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:52:21.1923766Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:52:21.1925601Z -- Detecting CXX compile features 2025-05-07T19:52:21.1934651Z -- Detecting CXX compile features - done 2025-05-07T19:52:21.2011322Z -- Detecting C compiler ABI info 2025-05-07T19:52:21.3191655Z -- Detecting C compiler ABI info - done 2025-05-07T19:52:21.3317030Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:52:21.3319080Z -- Detecting C compile features 2025-05-07T19:52:21.3323268Z -- Detecting C compile features - done 2025-05-07T19:52:21.3374206Z -- Detecting CUDA compiler ABI info 2025-05-07T19:52:22.2387897Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:52:22.2855100Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:52:22.2875974Z -- Detecting CUDA compile features 2025-05-07T19:52:22.2879862Z -- Detecting CUDA compile features - done 2025-05-07T19:52:22.2904237Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:52:22.5766000Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:52:22.5767063Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:52:22.8998941Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:52:22.9000763Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:52:23.1830070Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:52:23.1830598Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:52:23.5063649Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:52:23.5064041Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:52:23.7891683Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:52:23.7892741Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:52:24.1168511Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:52:24.1169489Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:52:24.3994916Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:52:24.7248554Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:52:24.7249544Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:52:24.7250542Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:52:25.0093364Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:52:25.0094428Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:52:25.3345414Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:52:25.3346637Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:52:25.6196537Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:52:25.6197635Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:52:25.9491358Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:52:25.9676542Z -- Found CUDA: /github/home/miniconda/envs/build_binary (found version "11.8") 2025-05-07T19:52:25.9716146Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/include (found version "11.8.89") 2025-05-07T19:52:25.9792820Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:52:26.0995599Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:52:26.1003031Z -- Found Threads: TRUE 2025-05-07T19:52:26.1790619Z -- PyTorch: CUDA detected: 11.8 2025-05-07T19:52:26.1791269Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:52:26.1791873Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary 2025-05-07T19:52:26.3227545Z -- PyTorch: Header version is: 11.8 2025-05-07T19:52:26.4022356Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.11.11") found components: Interpreter 2025-05-07T19:52:26.4037664Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:52:26.4038548Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:52:26.4038906Z Call Stack (most recent call first): 2025-05-07T19:52:26.4039640Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:52:26.4040780Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:52:26.4041678Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:52:26.4042274Z CMakeLists.txt:112 (include) 2025-05-07T19:52:26.4042460Z 2025-05-07T19:52:26.4042465Z 2025-05-07T19:52:26.4042635Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:52:26.4043347Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:52:26.4043792Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:52:26.4044219Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:52:26.4044798Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80 2025-05-07T19:52:26.4370466Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:52:26.4372967Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:52:26.4374063Z Call Stack (most recent call first): 2025-05-07T19:52:26.4376411Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:52:26.4379221Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:52:26.4380540Z CMakeLists.txt:112 (include) 2025-05-07T19:52:26.4381106Z 2025-05-07T19:52:26.4381120Z 2025-05-07T19:52:26.4381204Z 2025-05-07T19:52:26.4381215Z 2025-05-07T19:52:26.4381564Z ================================================================================ 2025-05-07T19:52:26.4382332Z PyTorch Flags: 2025-05-07T19:52:26.4382564Z 2025-05-07T19:52:26.4382791Z TORCH_INCLUDE_DIRS: 2025-05-07T19:52:26.4383231Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:26.4384058Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:26.4384661Z 2025-05-07T19:52:26.4384880Z TORCH_LIBRARIES: 2025-05-07T19:52:26.4385109Z torch 2025-05-07T19:52:26.4385326Z torch_library 2025-05-07T19:52:26.4385995Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:26.4386632Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:26.4387278Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:26.4387824Z 2025-05-07T19:52:26.4388047Z TORCH_CUDA_OPTIONS: 2025-05-07T19:52:26.4388304Z --expt-relaxed-constexpr 2025-05-07T19:52:26.4388606Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:26.4388903Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:26.4389228Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:26.4389539Z ================================================================================ 2025-05-07T19:52:26.4389805Z 2025-05-07T19:52:26.4389809Z 2025-05-07T19:52:26.4389814Z 2025-05-07T19:52:26.4389937Z ================================================================================ 2025-05-07T19:52:26.4390290Z NCCL Flags 2025-05-07T19:52:26.4390419Z 2025-05-07T19:52:26.4390815Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:26.4391776Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:26.4392462Z ================================================================================ 2025-05-07T19:52:26.4392799Z 2025-05-07T19:52:26.4393026Z 2025-05-07T19:52:26.4393031Z 2025-05-07T19:52:26.4393151Z ================================================================================ 2025-05-07T19:52:26.4393508Z CUDA Driver Path 2025-05-07T19:52:26.4393651Z 2025-05-07T19:52:26.4393933Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:26.4394467Z ================================================================================ 2025-05-07T19:52:26.4394700Z 2025-05-07T19:52:26.4395111Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so 2025-05-07T19:52:26.4395904Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:26.4409476Z 2025-05-07T19:52:26.4409577Z 2025-05-07T19:52:26.4410096Z ================================================================================ 2025-05-07T19:52:26.4411739Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:52:26.4412228Z 2025-05-07T19:52:26.4412456Z CPU_SRCS: 2025-05-07T19:52:26.4412579Z 2025-05-07T19:52:26.4412732Z 2025-05-07T19:52:26.4412942Z GPU_SRCS: 2025-05-07T19:52:26.4413061Z 2025-05-07T19:52:26.4413146Z 2025-05-07T19:52:26.4413367Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:26.4413517Z 2025-05-07T19:52:26.4413619Z 2025-05-07T19:52:26.4413820Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:26.4413966Z 2025-05-07T19:52:26.4414066Z 2025-05-07T19:52:26.4414257Z OTHER_SRCS: 2025-05-07T19:52:26.4414685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:52:26.4415330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:52:26.4415970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:52:26.4416608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:52:26.4417275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:52:26.4437605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:52:26.4438400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:52:26.4439024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:52:26.4439656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:52:26.4440265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:52:26.4440902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:52:26.4441527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:52:26.4442160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:52:26.4442791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:52:26.4443402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:52:26.4444047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:52:26.4444655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:52:26.4445290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:52:26.4445916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:52:26.4446526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:52:26.4447151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:52:26.4447770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:52:26.4448424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:52:26.4449058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:52:26.4449868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:52:26.4450491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:52:26.4451107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:52:26.4451874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:52:26.4452439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:52:26.4453018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:52:26.4453611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:52:26.4454238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:52:26.4454911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:52:26.4455485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:52:26.4456074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:52:26.4456679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:52:26.4457267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:52:26.4457855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:52:26.4458427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:52:26.4459018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:52:26.4459585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:52:26.4460247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:52:26.4460836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:52:26.4461423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:52:26.4461998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:52:26.4462604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:52:26.4463198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:52:26.4463809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:52:26.4464398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:52:26.4465024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:52:26.4465640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:52:26.4466232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:52:26.4466867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:52:26.4467479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:52:26.4468080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:52:26.4468664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:52:26.4469269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:52:26.4469875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:52:26.4470461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:52:26.4470915Z 2025-05-07T19:52:26.4471114Z CC_FLAGS: 2025-05-07T19:52:26.4471256Z 2025-05-07T19:52:26.4471343Z 2025-05-07T19:52:26.4471539Z NVCC_FLAGS: 2025-05-07T19:52:26.4471685Z 2025-05-07T19:52:26.4471768Z 2025-05-07T19:52:26.4471981Z HIPCC_FLAGS: 2025-05-07T19:52:26.4472113Z 2025-05-07T19:52:26.4472197Z 2025-05-07T19:52:26.4472489Z INCLUDE_DIRS: 2025-05-07T19:52:26.4472851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:26.4473387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:26.4473787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:26.4474134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:26.4474650Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:26.4475487Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:26.4476175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:26.4476610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:26.4477076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:26.4477570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:26.4478221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:26.4478699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:26.4479297Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:26.4479832Z 2025-05-07T19:52:26.4480039Z Selected Source Files: 2025-05-07T19:52:26.4480198Z 2025-05-07T19:52:26.4480304Z 2025-05-07T19:52:26.4480511Z HIPified Source Files: 2025-05-07T19:52:26.4480671Z 2025-05-07T19:52:26.4480767Z 2025-05-07T19:52:26.4480966Z Library Dependencies: 2025-05-07T19:52:26.4481232Z torch 2025-05-07T19:52:26.4481434Z torch_library 2025-05-07T19:52:26.4481909Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:26.4482521Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:26.4483166Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:26.4484013Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:26.4484698Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:26.4485250Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:26.4485892Z 2025-05-07T19:52:26.4486116Z Output Library: 2025-05-07T19:52:26.4486343Z asmjit 2025-05-07T19:52:26.4486564Z 2025-05-07T19:52:26.4486762Z Destination Directory: 2025-05-07T19:52:26.4487038Z fbgemm_gpu 2025-05-07T19:52:26.4487291Z ================================================================================ 2025-05-07T19:52:26.4487532Z 2025-05-07T19:52:26.4487536Z 2025-05-07T19:52:26.4487540Z 2025-05-07T19:52:26.4487669Z ================================================================================ 2025-05-07T19:52:26.4488047Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:52:26.4488350Z 2025-05-07T19:52:26.4488555Z CPU_SRCS: 2025-05-07T19:52:26.4488681Z 2025-05-07T19:52:26.4488767Z 2025-05-07T19:52:26.4488982Z GPU_SRCS: 2025-05-07T19:52:26.4489106Z 2025-05-07T19:52:26.4489210Z 2025-05-07T19:52:26.4489419Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:26.4489567Z 2025-05-07T19:52:26.4489670Z 2025-05-07T19:52:26.4489874Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:26.4490021Z 2025-05-07T19:52:26.4490121Z 2025-05-07T19:52:26.4490314Z OTHER_SRCS: 2025-05-07T19:52:26.4490599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:52:26.4491091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:52:26.4491566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:26.4492007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:52:26.4492431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:52:26.4492943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:52:26.4493416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:52:26.4493819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:52:26.4494221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:26.4494827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:52:26.4495289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:26.4495729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:52:26.4496196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:52:26.4496575Z 2025-05-07T19:52:26.4496794Z CC_FLAGS: 2025-05-07T19:52:26.4496919Z 2025-05-07T19:52:26.4497003Z 2025-05-07T19:52:26.4497224Z NVCC_FLAGS: 2025-05-07T19:52:26.4497349Z 2025-05-07T19:52:26.4497433Z 2025-05-07T19:52:26.4497764Z HIPCC_FLAGS: 2025-05-07T19:52:26.4497887Z 2025-05-07T19:52:26.4497982Z 2025-05-07T19:52:26.4498164Z INCLUDE_DIRS: 2025-05-07T19:52:26.4498410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:26.4498710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:26.4499077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:26.4499380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:26.4499874Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:26.4500627Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:26.4501266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:26.4501679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:26.4502090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:26.4502557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:26.4503056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:26.4503513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:26.4504045Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:26.4504546Z 2025-05-07T19:52:26.4504756Z Selected Source Files: 2025-05-07T19:52:26.4504905Z 2025-05-07T19:52:26.4504984Z 2025-05-07T19:52:26.4505195Z HIPified Source Files: 2025-05-07T19:52:26.4505345Z 2025-05-07T19:52:26.4505423Z 2025-05-07T19:52:26.4505630Z Library Dependencies: 2025-05-07T19:52:26.4505853Z torch 2025-05-07T19:52:26.4506055Z torch_library 2025-05-07T19:52:26.4506468Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:26.4507046Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:26.4507619Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:26.4508389Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:26.4509032Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:26.4509397Z asmjit 2025-05-07T19:52:26.4509727Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:26.4510120Z 2025-05-07T19:52:26.4510326Z Output Library: 2025-05-07T19:52:26.4510529Z fbgemm 2025-05-07T19:52:26.4510731Z 2025-05-07T19:52:26.4510920Z Destination Directory: 2025-05-07T19:52:26.4511171Z fbgemm_gpu 2025-05-07T19:52:26.4511416Z ================================================================================ 2025-05-07T19:52:26.4511639Z 2025-05-07T19:52:26.4511642Z 2025-05-07T19:52:26.4511646Z 2025-05-07T19:52:26.4511757Z ================================================================================ 2025-05-07T19:52:26.4512098Z Running code generation script ... 2025-05-07T19:52:26.4512920Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:52:26.4513923Z ================================================================================ 2025-05-07T19:52:26.4514164Z 2025-05-07T19:52:26.9818608Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:26.9820340Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:52:26.9821139Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:26.9821641Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:26.9822168Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:26.9822697Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:26.9823221Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:26.9823949Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:26.9824429Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:26.9824923Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:26.9825422Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:26.9826037Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:26.9826527Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:26.9827045Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:26.9827553Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:26.9828107Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:26.9828638Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:26.9829150Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:26.9829674Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:26.9830189Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:26.9830752Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:26.9831279Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:26.9831782Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:52:26.9832211Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:26.9832689Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:26.9833324Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:26.9833920Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:26.9834473Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:26.9834977Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:26.9835529Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:26.9836088Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:26.9836613Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:26.9837204Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:26.9837790Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:26.9838364Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:26.9838940Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:26.9839644Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:26.9840163Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:52:26.9840588Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:26.9840986Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:26.9841422Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:26.9841838Z Written: lookup_adagrad.py 2025-05-07T19:52:26.9842147Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:26.9842560Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:26.9842991Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:26.9843576Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:26.9844052Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:26.9844510Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:26.9845011Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:26.9845476Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:26.9845954Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:26.9846411Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:26.9846900Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:26.9847409Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:26.9848302Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:26.9848807Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:26.9849297Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:26.9849815Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:26.9850346Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:26.9850872Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:26.9851391Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:26.9851886Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:26.9852410Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:26.9852948Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:26.9853484Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:26.9853969Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:52:26.9854395Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:26.9854773Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:26.9855200Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:26.9855605Z Written: lookup_adam.py 2025-05-07T19:52:26.9855897Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:26.9856327Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:26.9856770Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:26.9857240Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:26.9857723Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:26.9858168Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:26.9858653Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:26.9859136Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:26.9859618Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:26.9860121Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:26.9860653Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:26.9861159Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:26.9861667Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:26.9862207Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:26.9862684Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:52:26.9863104Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:26.9863463Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:26.9863927Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:26.9864317Z Written: lookup_lamb.py 2025-05-07T19:52:26.9864744Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:26.9865187Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:26.9865696Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:26.9866206Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:26.9866759Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:26.9867251Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:26.9867801Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:26.9868359Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:26.9868870Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:26.9869436Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:26.9870043Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:26.9870589Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:26.9871194Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:26.9871767Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:26.9872310Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:52:26.9872844Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:26.9873487Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:26.9873979Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:26.9874451Z Written: lookup_lars_sgd.py 2025-05-07T19:52:26.9874843Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:26.9875329Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:26.9875914Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:26.9876534Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:26.9877208Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:26.9877824Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:26.9878491Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:26.9879175Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:26.9879815Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:26.9880526Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:26.9881226Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:26.9881922Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:26.9882622Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:26.9883348Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.0704335Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:52:27.0706052Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:27.0707674Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:27.0709387Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0710670Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:52:27.0711117Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:27.0711671Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.0712300Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:27.0713493Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.0714202Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:27.0714822Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:27.0715456Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.0716117Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:27.0716748Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.0717445Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.0718148Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.0718916Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.0719712Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.0720359Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.0720973Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:52:27.0721515Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:27.0721988Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:27.0722532Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0722990Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:52:27.0723398Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:27.0723920Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.0724493Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:27.0725044Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:27.0725563Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:27.0726084Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:27.0726616Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.0727195Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.0727745Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.0728305Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.0728858Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:27.0729376Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:27.0729928Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:27.0730470Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:27.0731012Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:52:27.0731520Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:27.0732076Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.0732660Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.0733221Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.0733783Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.0734325Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:52:27.0734869Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:27.0735438Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.0736130Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.0736717Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:27.0737263Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.0737875Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.0738469Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.0739067Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.0739644Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.0740227Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:27.0740784Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.0741403Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.0741996Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.0742565Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:27.0743120Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.0743702Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.0744307Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.0744923Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.0745515Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.0746104Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:27.0746668Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.0747280Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:27.0747886Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:27.0748481Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:27.0749103Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:27.0749703Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:27.0750312Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:27.0750928Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:27.0751543Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:27.0752135Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:27.0752811Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:27.0753654Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:27.0754266Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:27.0754844Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:52:27.0755407Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.0755910Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:27.0756357Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.0756866Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0757332Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:52:27.0757718Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:27.0758180Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.0758802Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0759374Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:52:27.0759737Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:27.0760177Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:27.0760665Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.0761213Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.0761742Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:27.0762220Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.0762747Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0763345Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:27.0763872Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.0764498Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:52:27.0765088Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:27.0765643Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:27.0766268Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0766883Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:27.0767503Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.0768180Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:52:27.0768833Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:27.0769439Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:27.0770097Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.0770767Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:27.1755441Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1757757Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:27.1759735Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:27.1761740Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.1762443Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.1763132Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:27.1764035Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:27.1764656Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:27.1765280Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.1765935Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.1766560Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:27.1767205Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.1767859Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.1768536Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.1769496Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.1770180Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.1770857Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.1771517Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.1772206Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.1772911Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.1773584Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.1774227Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:52:27.1774878Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:27.1775419Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:27.1776007Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.1776500Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:52:27.1776943Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:27.1777513Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1778160Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:52:27.1778772Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:27.1779343Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:27.1779974Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.1780603Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:27.1781237Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1781868Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.1782407Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:27.1782892Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.1783443Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.1784007Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:27.1784549Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1785075Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:27.1785503Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:27.1786142Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.1786614Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.1787050Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:27.1787499Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:27.1787926Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:27.1788377Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.1788842Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.1789302Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:27.1789761Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.1790242Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.1790742Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.1791333Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.1791845Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.1792324Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.1792909Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.1793625Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.1794200Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.1794738Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.1795228Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:52:27.1795661Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:27.1796108Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:27.1796561Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.1796966Z Written: lookup_sgd.py 2025-05-07T19:52:27.1797276Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:27.1797695Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:27.1798136Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1798660Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:52:27.1799140Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:27.1799584Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:27.1800075Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.1800589Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:27.1801097Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1801598Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:27.1802109Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.1802617Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:27.1803109Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:27.1803610Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.1804140Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:27.1804660Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.1805202Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.1805857Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.1806342Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.1806865Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.1807396Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.1807899Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:52:27.1808329Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:52:27.1808696Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:27.1809131Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.1809510Z Written: lookup_none.py 2025-05-07T19:52:27.1809809Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:27.1810224Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.1810717Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:52:27.1811242Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:52:27.1811801Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:52:27.1812316Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:52:27.1812808Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:52:27.1813362Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:52:27.1813828Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:52:27.1814332Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:52:27.1814855Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:52:27.1815393Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:52:27.1815916Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:52:27.1816407Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:52:27.1816900Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:52:27.1817367Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:52:27.1817887Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:27.1818349Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:27.1818854Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:27.1819354Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:27.1819749Z Written: pt2_arg_utils.h 2025-05-07T19:52:27.1820015Z Written: __init__.py 2025-05-07T19:52:27.1820263Z Written: lookup_args_ssd.py 2025-05-07T19:52:27.1820539Z Written: lookup_args.py 2025-05-07T19:52:27.1867663Z 2025-05-07T19:52:27.1867775Z 2025-05-07T19:52:27.1868304Z ================================================================================ 2025-05-07T19:52:27.1869444Z Running code generation script ... 2025-05-07T19:52:27.1871841Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:52:27.1873531Z ================================================================================ 2025-05-07T19:52:27.1873811Z 2025-05-07T19:52:27.2875722Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:27.2878368Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:52:27.2880675Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:27.2881188Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:27.2881663Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:27.2882182Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.2882782Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:52:27.2883150Z Written: optimizer_args.py 2025-05-07T19:52:27.2962876Z 2025-05-07T19:52:27.2962895Z 2025-05-07T19:52:27.2963524Z ================================================================================ 2025-05-07T19:52:27.2964652Z Running code generation script ... 2025-05-07T19:52:27.2967029Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:52:27.2969420Z ================================================================================ 2025-05-07T19:52:27.2970178Z 2025-05-07T19:52:27.4103463Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:27.4106161Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:52:27.4108791Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:27.4110856Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:27.4112159Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:27.4112939Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:27.4114122Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:27.4114820Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:27.4115568Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:27.4116335Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:27.4117113Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:27.4117891Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:27.4118651Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:27.4119629Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:27.4120314Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:27.4121000Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:27.4121682Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:27.4122344Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:27.4123026Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:27.4123688Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:27.4124344Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:27.4124999Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:27.4125636Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:27.4126216Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:27.4126706Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:27.4193962Z 2025-05-07T19:52:27.4194552Z 2025-05-07T19:52:27.4195026Z ================================================================================ 2025-05-07T19:52:27.4196107Z Running code generation script ... 2025-05-07T19:52:27.4198389Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:52:27.4200711Z ================================================================================ 2025-05-07T19:52:27.4201394Z 2025-05-07T19:52:27.7602145Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:27.7603771Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:52:27.7604551Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:27.7605072Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:27.7605581Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:27.7606114Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:27.7606624Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:27.7607124Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:27.7607624Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:27.7608094Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:27.7608709Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:27.7609315Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:27.7609801Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:27.7610498Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:27.7610988Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:27.7611492Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:27.7611984Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:27.7612502Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:27.7612989Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:27.7613472Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:27.7613972Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:27.7614453Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:27.7614933Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:27.7615488Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:27.7615975Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:27.7616422Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:27.7616908Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:27.7617411Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:27.7617883Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:27.7618361Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:27.7618810Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:27.7619245Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:27.7619672Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:27.7620141Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:27.7620598Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:27.7621017Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:27.7621457Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:27.7621872Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:27.7622284Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:27.7622704Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:27.7623176Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:27.7623639Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:27.7624076Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:27.7624515Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:27.7624923Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:27.7625378Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:27.7625819Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:27.7626292Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:27.7626767Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:27.7627198Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:27.7627640Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:27.7628108Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:27.7628632Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:27.7629128Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:27.7629640Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:27.7630108Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.7630550Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.7630980Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.7714563Z 2025-05-07T19:52:27.7714620Z 2025-05-07T19:52:27.7715137Z ================================================================================ 2025-05-07T19:52:27.7716254Z Running code generation script ... 2025-05-07T19:52:27.7718546Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:52:27.7720057Z ================================================================================ 2025-05-07T19:52:27.7720296Z 2025-05-07T19:52:28.0289073Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:28.0393005Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:52:28.0395189Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:28.0396944Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:28.0398244Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:28.0399617Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:28.0400954Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:28.0401716Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:28.0402227Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:52:28.0402797Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:28.0403467Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:52:28.0403893Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:52:28.0404172Z 2025-05-07T19:52:28.0404177Z 2025-05-07T19:52:28.0404300Z ================================================================================ 2025-05-07T19:52:28.0404722Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:52:28.0405069Z 2025-05-07T19:52:28.0405277Z CPU_SRCS: 2025-05-07T19:52:28.0405684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:28.0406384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:28.0407053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:28.0407754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:28.0408375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:28.0408873Z 2025-05-07T19:52:28.0409085Z GPU_SRCS: 2025-05-07T19:52:28.0409444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:52:28.0410064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:52:28.0410700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:52:28.0411380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:52:28.0411999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:52:28.0412613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:52:28.0413264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:52:28.0413973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:52:28.0414563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:52:28.0415204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:52:28.0415689Z 2025-05-07T19:52:28.0415884Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.0416040Z 2025-05-07T19:52:28.0416119Z 2025-05-07T19:52:28.0416326Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.0416467Z 2025-05-07T19:52:28.0416548Z 2025-05-07T19:52:28.0416751Z OTHER_SRCS: 2025-05-07T19:52:28.0416868Z 2025-05-07T19:52:28.0416945Z 2025-05-07T19:52:28.0417254Z CC_FLAGS: 2025-05-07T19:52:28.0417372Z 2025-05-07T19:52:28.0417450Z 2025-05-07T19:52:28.0417652Z NVCC_FLAGS: 2025-05-07T19:52:28.0417867Z --expt-relaxed-constexpr 2025-05-07T19:52:28.0418157Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.0418439Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.0418739Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.0419007Z 2025-05-07T19:52:28.0419205Z HIPCC_FLAGS: 2025-05-07T19:52:28.0419339Z 2025-05-07T19:52:28.0419413Z 2025-05-07T19:52:28.0419588Z INCLUDE_DIRS: 2025-05-07T19:52:28.0419829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.0420139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.0420429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.0420753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.0421241Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.0422100Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.0422739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.0423162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.0423587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.0424068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.0424602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.0425058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.0425630Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.0426130Z 2025-05-07T19:52:28.0426338Z Selected Source Files: 2025-05-07T19:52:28.0426763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:28.0427428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:28.0428076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:28.0428683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:28.0429300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:28.0429924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:52:28.0430519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:52:28.0431143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:52:28.0431791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:52:28.0432405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:52:28.0433279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:52:28.0433937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:52:28.0434541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:52:28.0435154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:52:28.0435822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:52:28.0436323Z 2025-05-07T19:52:28.0436547Z HIPified Source Files: 2025-05-07T19:52:28.0436707Z 2025-05-07T19:52:28.0436793Z 2025-05-07T19:52:28.0437027Z Library Dependencies: 2025-05-07T19:52:28.0437271Z torch 2025-05-07T19:52:28.0437487Z torch_library 2025-05-07T19:52:28.0437937Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.0438565Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.0439303Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.0440133Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.0440775Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.0441267Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.0441668Z 2025-05-07T19:52:28.0441854Z Output Library: 2025-05-07T19:52:28.0442087Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:28.0442303Z 2025-05-07T19:52:28.0442508Z Destination Directory: 2025-05-07T19:52:28.0442731Z fbgemm_gpu 2025-05-07T19:52:28.0442962Z ================================================================================ 2025-05-07T19:52:28.0443217Z 2025-05-07T19:52:28.0875951Z 2025-05-07T19:52:28.0875970Z 2025-05-07T19:52:28.0876493Z ================================================================================ 2025-05-07T19:52:28.0877790Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:52:28.0879310Z 2025-05-07T19:52:28.0879880Z CPU_SRCS: 2025-05-07T19:52:28.0880761Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:28.0881924Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:28.0882384Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:28.0882770Z 2025-05-07T19:52:28.0882974Z GPU_SRCS: 2025-05-07T19:52:28.0883295Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:28.0883796Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:52:28.0884373Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:28.0885075Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:28.0885955Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:28.0886599Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:28.0887279Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:28.0887920Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:28.0888602Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:28.0889292Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:28.0890001Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:28.0890705Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:28.0891389Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:28.0892222Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:28.0892870Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:28.0893518Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:28.0894168Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:28.0894786Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:28.0895429Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:28.0896048Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:28.0896650Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:28.0897239Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:28.0897850Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.0898294Z 2025-05-07T19:52:28.0898494Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.0898647Z 2025-05-07T19:52:28.0898750Z 2025-05-07T19:52:28.0898950Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.0899111Z 2025-05-07T19:52:28.0899196Z 2025-05-07T19:52:28.0900915Z OTHER_SRCS: 2025-05-07T19:52:28.0901072Z 2025-05-07T19:52:28.0901163Z 2025-05-07T19:52:28.0901352Z CC_FLAGS: 2025-05-07T19:52:28.0901494Z 2025-05-07T19:52:28.0901579Z 2025-05-07T19:52:28.0901790Z NVCC_FLAGS: 2025-05-07T19:52:28.0902015Z --expt-relaxed-constexpr 2025-05-07T19:52:28.0902317Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.0902605Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.0902923Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.0903181Z 2025-05-07T19:52:28.0903395Z HIPCC_FLAGS: 2025-05-07T19:52:28.0903526Z 2025-05-07T19:52:28.0903609Z 2025-05-07T19:52:28.0903816Z INCLUDE_DIRS: 2025-05-07T19:52:28.0904060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.0904399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.0904687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.0905018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.0905624Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.0906429Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.0907108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.0907528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.0907978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.0908453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.0908993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.0909470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.0910031Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.0910555Z 2025-05-07T19:52:28.0910757Z Selected Source Files: 2025-05-07T19:52:28.0911120Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:28.0911585Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:28.0912054Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:28.0912490Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:28.0913271Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:52:28.0913864Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:28.0914497Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:28.0915143Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:28.0915760Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:28.0916401Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:28.0917042Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:28.0917706Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:28.0918407Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:28.0919084Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:28.0919885Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:28.0920546Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:28.0921221Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:28.0921883Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:28.0922510Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:28.0923152Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:28.0923849Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:28.0924494Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:28.0925136Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:28.0925736Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:28.0926352Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:28.0926954Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.0927396Z 2025-05-07T19:52:28.0927601Z HIPified Source Files: 2025-05-07T19:52:28.0927778Z 2025-05-07T19:52:28.0927860Z 2025-05-07T19:52:28.0928061Z Library Dependencies: 2025-05-07T19:52:28.0928315Z torch 2025-05-07T19:52:28.0928526Z torch_library 2025-05-07T19:52:28.0929033Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.0929645Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.0930253Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.0931073Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.0931741Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.0932150Z asmjit 2025-05-07T19:52:28.0932369Z fbgemm 2025-05-07T19:52:28.0932577Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:28.0932843Z fbgemm_gpu_config 2025-05-07T19:52:28.0933206Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.0933633Z 2025-05-07T19:52:28.0933836Z Output Library: 2025-05-07T19:52:28.0934097Z fbgemm_gpu_tbe_inference 2025-05-07T19:52:28.0934347Z 2025-05-07T19:52:28.0934580Z Destination Directory: 2025-05-07T19:52:28.0934829Z fbgemm_gpu 2025-05-07T19:52:28.0935093Z ================================================================================ 2025-05-07T19:52:28.0935330Z 2025-05-07T19:52:28.3114488Z 2025-05-07T19:52:28.3114620Z 2025-05-07T19:52:28.3115108Z ================================================================================ 2025-05-07T19:52:28.3116319Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:52:28.3117287Z 2025-05-07T19:52:28.3117838Z CPU_SRCS: 2025-05-07T19:52:28.3118460Z src/config/feature_gates.cpp 2025-05-07T19:52:28.3119220Z 2025-05-07T19:52:28.3119652Z GPU_SRCS: 2025-05-07T19:52:28.3119792Z 2025-05-07T19:52:28.3119874Z 2025-05-07T19:52:28.3120074Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3120240Z 2025-05-07T19:52:28.3120323Z 2025-05-07T19:52:28.3120641Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3120796Z 2025-05-07T19:52:28.3120878Z 2025-05-07T19:52:28.3121084Z OTHER_SRCS: 2025-05-07T19:52:28.3121205Z 2025-05-07T19:52:28.3121285Z 2025-05-07T19:52:28.3121489Z CC_FLAGS: 2025-05-07T19:52:28.3121610Z 2025-05-07T19:52:28.3201893Z 2025-05-07T19:52:28.3202213Z NVCC_FLAGS: 2025-05-07T19:52:28.3202379Z 2025-05-07T19:52:28.3202464Z 2025-05-07T19:52:28.3202690Z HIPCC_FLAGS: 2025-05-07T19:52:28.3202847Z 2025-05-07T19:52:28.3202927Z 2025-05-07T19:52:28.3203146Z INCLUDE_DIRS: 2025-05-07T19:52:28.3203388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3203725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3204017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3204370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3204882Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3205861Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3206563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3206991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3207470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3207944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3208678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3209144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3209722Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3210272Z 2025-05-07T19:52:28.3210501Z Selected Source Files: 2025-05-07T19:52:28.3210788Z src/config/feature_gates.cpp 2025-05-07T19:52:28.3211047Z 2025-05-07T19:52:28.3211256Z HIPified Source Files: 2025-05-07T19:52:28.3211409Z 2025-05-07T19:52:28.3211485Z 2025-05-07T19:52:28.3211690Z Library Dependencies: 2025-05-07T19:52:28.3211914Z torch 2025-05-07T19:52:28.3212126Z torch_library 2025-05-07T19:52:28.3212568Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3213178Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3213901Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3214709Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3215378Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3215902Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3216318Z 2025-05-07T19:52:28.3216507Z Output Library: 2025-05-07T19:52:28.3216745Z fbgemm_gpu_config 2025-05-07T19:52:28.3216958Z 2025-05-07T19:52:28.3217165Z Destination Directory: 2025-05-07T19:52:28.3217403Z fbgemm_gpu 2025-05-07T19:52:28.3217652Z ================================================================================ 2025-05-07T19:52:28.3217888Z 2025-05-07T19:52:28.3217893Z 2025-05-07T19:52:28.3217897Z 2025-05-07T19:52:28.3218021Z ================================================================================ 2025-05-07T19:52:28.3218409Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:52:28.3218762Z 2025-05-07T19:52:28.3218948Z CPU_SRCS: 2025-05-07T19:52:28.3219258Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:28.3219718Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:28.3220094Z 2025-05-07T19:52:28.3220292Z GPU_SRCS: 2025-05-07T19:52:28.3220562Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:28.3220986Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:52:28.3221374Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:52:28.3221769Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:52:28.3222163Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:52:28.3222523Z 2025-05-07T19:52:28.3222714Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3222867Z 2025-05-07T19:52:28.3222947Z 2025-05-07T19:52:28.3223146Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3223294Z 2025-05-07T19:52:28.3223370Z 2025-05-07T19:52:28.3223565Z OTHER_SRCS: 2025-05-07T19:52:28.3223685Z 2025-05-07T19:52:28.3223763Z 2025-05-07T19:52:28.3223953Z CC_FLAGS: 2025-05-07T19:52:28.3224066Z 2025-05-07T19:52:28.3224140Z 2025-05-07T19:52:28.3224333Z NVCC_FLAGS: 2025-05-07T19:52:28.3224451Z 2025-05-07T19:52:28.3224530Z 2025-05-07T19:52:28.3224724Z HIPCC_FLAGS: 2025-05-07T19:52:28.3224847Z 2025-05-07T19:52:28.3224923Z 2025-05-07T19:52:28.3225116Z INCLUDE_DIRS: 2025-05-07T19:52:28.3225357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3225664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3225958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3226261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3226763Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3227553Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3228226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3228634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3229167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3229781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3230292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3230760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3231308Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3231819Z 2025-05-07T19:52:28.3232011Z Selected Source Files: 2025-05-07T19:52:28.3232349Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:28.3233119Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:28.3233558Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:28.3233977Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:52:28.3234445Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:52:28.3234830Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:52:28.3235224Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:52:28.3235580Z 2025-05-07T19:52:28.3235776Z HIPified Source Files: 2025-05-07T19:52:28.3235944Z 2025-05-07T19:52:28.3236021Z 2025-05-07T19:52:28.3236233Z Library Dependencies: 2025-05-07T19:52:28.3236462Z torch 2025-05-07T19:52:28.3236667Z torch_library 2025-05-07T19:52:28.3237105Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3237722Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3238329Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3239149Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3239817Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3240354Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3240774Z 2025-05-07T19:52:28.3240963Z Output Library: 2025-05-07T19:52:28.3241211Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3241427Z 2025-05-07T19:52:28.3241617Z Destination Directory: 2025-05-07T19:52:28.3241841Z fbgemm_gpu 2025-05-07T19:52:28.3242062Z ================================================================================ 2025-05-07T19:52:28.3242289Z 2025-05-07T19:52:28.3242293Z 2025-05-07T19:52:28.3242297Z 2025-05-07T19:52:28.3242411Z ================================================================================ 2025-05-07T19:52:28.3242816Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:52:28.3243195Z 2025-05-07T19:52:28.3243390Z CPU_SRCS: 2025-05-07T19:52:28.3243651Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:28.3243953Z 2025-05-07T19:52:28.3244170Z GPU_SRCS: 2025-05-07T19:52:28.3244406Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:52:28.3244692Z 2025-05-07T19:52:28.3244899Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3245050Z 2025-05-07T19:52:28.3245139Z 2025-05-07T19:52:28.3245351Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3245490Z 2025-05-07T19:52:28.3245562Z 2025-05-07T19:52:28.3245736Z OTHER_SRCS: 2025-05-07T19:52:28.3245850Z 2025-05-07T19:52:28.3245926Z 2025-05-07T19:52:28.3246103Z CC_FLAGS: 2025-05-07T19:52:28.3246209Z 2025-05-07T19:52:28.3246282Z 2025-05-07T19:52:28.3246461Z NVCC_FLAGS: 2025-05-07T19:52:28.3246579Z 2025-05-07T19:52:28.3246668Z 2025-05-07T19:52:28.3246850Z HIPCC_FLAGS: 2025-05-07T19:52:28.3246980Z 2025-05-07T19:52:28.3247060Z 2025-05-07T19:52:28.3247230Z INCLUDE_DIRS: 2025-05-07T19:52:28.3247462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3247774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3248057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3248359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3248866Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3249752Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3250424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3250840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3251264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3251756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3252273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3252748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3253474Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3253986Z 2025-05-07T19:52:28.3254189Z Selected Source Files: 2025-05-07T19:52:28.3254521Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:28.3254853Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:52:28.3255138Z 2025-05-07T19:52:28.3255356Z HIPified Source Files: 2025-05-07T19:52:28.3255516Z 2025-05-07T19:52:28.3255593Z 2025-05-07T19:52:28.3255805Z Library Dependencies: 2025-05-07T19:52:28.3256051Z torch 2025-05-07T19:52:28.3256253Z torch_library 2025-05-07T19:52:28.3256691Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3257293Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3257908Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3258714Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3259377Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3259764Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3260132Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3260544Z 2025-05-07T19:52:28.3260746Z Output Library: 2025-05-07T19:52:28.3260990Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:28.3261250Z 2025-05-07T19:52:28.3261455Z Destination Directory: 2025-05-07T19:52:28.3261689Z fbgemm_gpu 2025-05-07T19:52:28.3261925Z ================================================================================ 2025-05-07T19:52:28.3262156Z 2025-05-07T19:52:28.3262160Z 2025-05-07T19:52:28.3262164Z 2025-05-07T19:52:28.3262276Z ================================================================================ 2025-05-07T19:52:28.3262657Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:52:28.3263003Z 2025-05-07T19:52:28.3263178Z CPU_SRCS: 2025-05-07T19:52:28.3263437Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:28.3263860Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:28.3264275Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:28.3264580Z 2025-05-07T19:52:28.3264774Z GPU_SRCS: 2025-05-07T19:52:28.3265001Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:52:28.3265351Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:52:28.3265698Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:28.3266016Z 2025-05-07T19:52:28.3266203Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3266361Z 2025-05-07T19:52:28.3266439Z 2025-05-07T19:52:28.3266636Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3266772Z 2025-05-07T19:52:28.3266849Z 2025-05-07T19:52:28.3267037Z OTHER_SRCS: 2025-05-07T19:52:28.3267153Z 2025-05-07T19:52:28.3267229Z 2025-05-07T19:52:28.3267417Z CC_FLAGS: 2025-05-07T19:52:28.3267526Z 2025-05-07T19:52:28.3267603Z 2025-05-07T19:52:28.3267785Z NVCC_FLAGS: 2025-05-07T19:52:28.3267995Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3268272Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3268560Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3268848Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3269109Z 2025-05-07T19:52:28.3269288Z HIPCC_FLAGS: 2025-05-07T19:52:28.3269411Z 2025-05-07T19:52:28.3269498Z 2025-05-07T19:52:28.3269679Z INCLUDE_DIRS: 2025-05-07T19:52:28.3270002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3270318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3270614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3270921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3271422Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3272233Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3272983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3273406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3273834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3274321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3274961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3275448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3276033Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3276537Z 2025-05-07T19:52:28.3276736Z Selected Source Files: 2025-05-07T19:52:28.3277027Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:28.3277457Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:28.3277858Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:28.3278223Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:28.3278575Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:52:28.3278918Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:52:28.3279219Z 2025-05-07T19:52:28.3279412Z HIPified Source Files: 2025-05-07T19:52:28.3279564Z 2025-05-07T19:52:28.3279650Z 2025-05-07T19:52:28.3279840Z Library Dependencies: 2025-05-07T19:52:28.3280071Z torch 2025-05-07T19:52:28.3280254Z torch_library 2025-05-07T19:52:28.3280698Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3281287Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3281905Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3282719Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3283382Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3283765Z fbgemm 2025-05-07T19:52:28.3283960Z fbgemm_gpu_config 2025-05-07T19:52:28.3284316Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3284721Z 2025-05-07T19:52:28.3284917Z Output Library: 2025-05-07T19:52:28.3285137Z fbgemm_gpu_tbe_common 2025-05-07T19:52:28.3285365Z 2025-05-07T19:52:28.3285551Z Destination Directory: 2025-05-07T19:52:28.3285998Z fbgemm_gpu 2025-05-07T19:52:28.3286227Z ================================================================================ 2025-05-07T19:52:28.3286459Z 2025-05-07T19:52:28.3286463Z 2025-05-07T19:52:28.3286468Z 2025-05-07T19:52:28.3286581Z ================================================================================ 2025-05-07T19:52:28.3286983Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:52:28.3287335Z 2025-05-07T19:52:28.3287528Z CPU_SRCS: 2025-05-07T19:52:28.3287647Z 2025-05-07T19:52:28.3287722Z 2025-05-07T19:52:28.3287909Z GPU_SRCS: 2025-05-07T19:52:28.3288156Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:28.3288561Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:28.3288981Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:28.3289317Z 2025-05-07T19:52:28.3289518Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3289657Z 2025-05-07T19:52:28.3289735Z 2025-05-07T19:52:28.3289940Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3290081Z 2025-05-07T19:52:28.3290157Z 2025-05-07T19:52:28.3290351Z OTHER_SRCS: 2025-05-07T19:52:28.3290467Z 2025-05-07T19:52:28.3290684Z 2025-05-07T19:52:28.3290873Z CC_FLAGS: 2025-05-07T19:52:28.3290986Z 2025-05-07T19:52:28.3291077Z 2025-05-07T19:52:28.3291255Z NVCC_FLAGS: 2025-05-07T19:52:28.3291478Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3291746Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3292031Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3292320Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3292575Z 2025-05-07T19:52:28.3292757Z HIPCC_FLAGS: 2025-05-07T19:52:28.3292894Z 2025-05-07T19:52:28.3292967Z 2025-05-07T19:52:28.3293144Z INCLUDE_DIRS: 2025-05-07T19:52:28.3293385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3293707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3293987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3294301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3294882Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3295689Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3296347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3296775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3297224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3297711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3298239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3298699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3299285Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3299795Z 2025-05-07T19:52:28.3299996Z Selected Source Files: 2025-05-07T19:52:28.3300284Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:28.3300694Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:28.3301125Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:28.3301462Z 2025-05-07T19:52:28.3301678Z HIPified Source Files: 2025-05-07T19:52:28.3301830Z 2025-05-07T19:52:28.3301908Z 2025-05-07T19:52:28.3302110Z Library Dependencies: 2025-05-07T19:52:28.3302332Z torch 2025-05-07T19:52:28.3302528Z torch_library 2025-05-07T19:52:28.3302962Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3303568Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3304178Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3305244Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3305921Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3306447Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3306863Z 2025-05-07T19:52:28.3307056Z Output Library: 2025-05-07T19:52:28.3307309Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:52:28.3307576Z 2025-05-07T19:52:28.3307777Z Destination Directory: 2025-05-07T19:52:28.3308031Z fbgemm_gpu 2025-05-07T19:52:28.3308266Z ================================================================================ 2025-05-07T19:52:28.3308501Z 2025-05-07T19:52:28.3308505Z 2025-05-07T19:52:28.3308519Z 2025-05-07T19:52:28.3308635Z ================================================================================ 2025-05-07T19:52:28.3309054Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:52:28.3309451Z 2025-05-07T19:52:28.3309642Z CPU_SRCS: 2025-05-07T19:52:28.3309898Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3310232Z 2025-05-07T19:52:28.3310421Z GPU_SRCS: 2025-05-07T19:52:28.3310678Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:28.3311048Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:28.3311418Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:28.3311873Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.3312319Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.3312820Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.3313219Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:28.3313617Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:28.3313988Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:28.3314387Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3314794Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:28.3315225Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3315654Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.3316086Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3316565Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:28.3317002Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3317436Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.3317845Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3318242Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:28.3318647Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3319066Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.3319481Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3319930Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3320377Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:28.3320769Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:28.3321186Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3321646Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3322081Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:28.3322492Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3322906Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:28.3323304Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:28.3323719Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3324140Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3324530Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:28.3324948Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3325409Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3325857Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.3326260Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:28.3326688Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3327168Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3327622Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:28.3328199Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3328615Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.3329031Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:28.3329447Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3329879Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3330279Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.3330706Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.3331181Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.3331628Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.3332271Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3332646Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3333044Z 2025-05-07T19:52:28.3333244Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3333406Z 2025-05-07T19:52:28.3333486Z 2025-05-07T19:52:28.3333681Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3333842Z 2025-05-07T19:52:28.3333928Z 2025-05-07T19:52:28.3334125Z OTHER_SRCS: 2025-05-07T19:52:28.3334243Z 2025-05-07T19:52:28.3334322Z 2025-05-07T19:52:28.3334525Z CC_FLAGS: 2025-05-07T19:52:28.3334642Z 2025-05-07T19:52:28.3334721Z 2025-05-07T19:52:28.3334911Z NVCC_FLAGS: 2025-05-07T19:52:28.3335129Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3335406Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3335687Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3335999Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3336254Z 2025-05-07T19:52:28.3336453Z HIPCC_FLAGS: 2025-05-07T19:52:28.3336577Z 2025-05-07T19:52:28.3336737Z 2025-05-07T19:52:28.3336917Z INCLUDE_DIRS: 2025-05-07T19:52:28.3337170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3337486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3337779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3338081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3338576Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3339368Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3340023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3340425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3340865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3341348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3341869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3342343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3342907Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3343422Z 2025-05-07T19:52:28.3343614Z Selected Source Files: 2025-05-07T19:52:28.3343912Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3344305Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:28.3344726Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.3345156Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:28.3345570Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.3345980Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:28.3346379Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.3346807Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3347235Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3347686Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3348149Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.3348555Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3348931Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3349283Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:28.3349647Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:28.3349995Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:28.3350377Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.3350786Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.3351196Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.3351746Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:28.3352112Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:28.3352486Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:28.3352953Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3353371Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3353894Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3354300Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3354697Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3355080Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3355490Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3355889Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:28.3356272Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:28.3356667Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3357119Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3357542Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:28.3358009Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3358423Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:28.3358832Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:28.3359254Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3359649Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:28.3360068Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3360495Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.3360902Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:28.3361327Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3361799Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:28.3362259Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:28.3362680Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3363107Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.3363524Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:28.3363960Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.3364356Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.3364788Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.3365254Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.3365701Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.3366062Z 2025-05-07T19:52:28.3366255Z HIPified Source Files: 2025-05-07T19:52:28.3366417Z 2025-05-07T19:52:28.3366488Z 2025-05-07T19:52:28.3366669Z Library Dependencies: 2025-05-07T19:52:28.3366889Z torch 2025-05-07T19:52:28.3367070Z torch_library 2025-05-07T19:52:28.3367506Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3368111Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3368713Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3369522Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3370179Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3370572Z fbgemm_gpu_tbe_common 2025-05-07T19:52:28.3370933Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3371333Z 2025-05-07T19:52:28.3371513Z Output Library: 2025-05-07T19:52:28.3371744Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:52:28.3371997Z 2025-05-07T19:52:28.3372182Z Destination Directory: 2025-05-07T19:52:28.3372411Z fbgemm_gpu 2025-05-07T19:52:28.3372629Z ================================================================================ 2025-05-07T19:52:28.3372857Z 2025-05-07T19:52:28.3372869Z 2025-05-07T19:52:28.3372873Z 2025-05-07T19:52:28.3372981Z ================================================================================ 2025-05-07T19:52:28.3373411Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:52:28.3373872Z 2025-05-07T19:52:28.3374056Z CPU_SRCS: 2025-05-07T19:52:28.3374285Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3374776Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3375125Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.3375445Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:28.3375757Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:28.3376087Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.3376458Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:28.3376884Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:28.3377256Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:28.3377646Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:28.3378127Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.3378518Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3379007Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:28.3379557Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:28.3380110Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:28.3380600Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3381016Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3381416Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3381863Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3382301Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3382686Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3383080Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3383489Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3383964Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3384485Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3384946Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3385632Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3386401Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3386921Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3387547Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3388222Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3388890Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3389494Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3389925Z 2025-05-07T19:52:28.3390100Z GPU_SRCS: 2025-05-07T19:52:28.3390376Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3390839Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3391299Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3391708Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3392112Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3392607Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3393112Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3393658Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3394136Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3394641Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3395325Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3395859Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3396499Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3397200Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3397913Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3398535Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3399105Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3399512Z 2025-05-07T19:52:28.3399712Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3399858Z 2025-05-07T19:52:28.3399954Z 2025-05-07T19:52:28.3400152Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3400372Z 2025-05-07T19:52:28.3400471Z 2025-05-07T19:52:28.3400661Z OTHER_SRCS: 2025-05-07T19:52:28.3400801Z 2025-05-07T19:52:28.3400885Z 2025-05-07T19:52:28.3401081Z CC_FLAGS: 2025-05-07T19:52:28.3401223Z 2025-05-07T19:52:28.3401308Z 2025-05-07T19:52:28.3401500Z NVCC_FLAGS: 2025-05-07T19:52:28.3401749Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3402046Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3402337Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3402663Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3402922Z 2025-05-07T19:52:28.3403130Z HIPCC_FLAGS: 2025-05-07T19:52:28.3403259Z 2025-05-07T19:52:28.3403340Z 2025-05-07T19:52:28.3403546Z INCLUDE_DIRS: 2025-05-07T19:52:28.3403787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3404126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3404414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3404743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3405264Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3406072Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3406752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3407173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3407625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3408107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3408651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3409133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3409702Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3410231Z 2025-05-07T19:52:28.3410436Z Selected Source Files: 2025-05-07T19:52:28.3410733Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3411124Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3411517Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.3411977Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:28.3412329Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:28.3412687Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.3413082Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:28.3413534Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:28.3413923Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:28.3414353Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:28.3414791Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.3415226Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3415734Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:28.3416324Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:28.3416906Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:28.3417483Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3417933Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.3418345Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3418829Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3419287Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3419692Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3420096Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3420503Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3420981Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3421504Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3423460Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3423953Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3424482Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3424994Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3425583Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3426251Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3426897Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3427498Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.3427988Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3428448Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3428909Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3429310Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3429719Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3430132Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3430620Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3431154Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3431635Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3432132Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3432747Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3433443Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3434103Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3434787Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3435458Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3436071Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3436607Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.3436977Z 2025-05-07T19:52:28.3437177Z HIPified Source Files: 2025-05-07T19:52:28.3437332Z 2025-05-07T19:52:28.3437405Z 2025-05-07T19:52:28.3437608Z Library Dependencies: 2025-05-07T19:52:28.3437832Z torch 2025-05-07T19:52:28.3438027Z torch_library 2025-05-07T19:52:28.3438459Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3439061Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3439676Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3440485Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3441248Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3441626Z fbgemm 2025-05-07T19:52:28.3441830Z fbgemm_gpu_config 2025-05-07T19:52:28.3442049Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:28.3442291Z fbgemm_gpu_tbe_common 2025-05-07T19:52:28.3442523Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3442779Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:28.3443179Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3443576Z 2025-05-07T19:52:28.3443770Z Output Library: 2025-05-07T19:52:28.3444008Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:52:28.3444290Z 2025-05-07T19:52:28.3444480Z Destination Directory: 2025-05-07T19:52:28.3444719Z fbgemm_gpu 2025-05-07T19:52:28.3444944Z ================================================================================ 2025-05-07T19:52:28.3445369Z 2025-05-07T19:52:28.3445373Z 2025-05-07T19:52:28.3445377Z 2025-05-07T19:52:28.3445486Z ================================================================================ 2025-05-07T19:52:28.3445907Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:52:28.3446270Z 2025-05-07T19:52:28.3446449Z CPU_SRCS: 2025-05-07T19:52:28.3446755Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:52:28.3447186Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:28.3447522Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:28.3447893Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:28.3448248Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:28.3448573Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:28.3448901Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:28.3449232Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:28.3449623Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:28.3450052Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:28.3450435Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:28.3450835Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:28.3451266Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:28.3451668Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:28.3452158Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:28.3452729Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:28.3453283Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:28.3453787Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:28.3454191Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:28.3454556Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:28.3455063Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:28.3455350Z 2025-05-07T19:52:28.3455529Z GPU_SRCS: 2025-05-07T19:52:28.3455772Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:28.3456187Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.3456626Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.3457059Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.3457479Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:28.3457946Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:28.3458429Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:28.3458927Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3459455Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3460004Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3460519Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:28.3461057Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3461570Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3462020Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:28.3462439Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3462888Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3463339Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3463829Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3464338Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3464801Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:28.3465229Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3465757Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3466232Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:28.3466718Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3467241Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3467751Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3468307Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3468874Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3469417Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:28.3469929Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3470453Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3470908Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:28.3471293Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3471716Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3472131Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3472656Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3473328Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3473771Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:28.3474191Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3474626Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3475044Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:28.3475446Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3475873Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3476314Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3476777Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3477285Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3477732Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:28.3478155Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3478598Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3479022Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:28.3479423Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3479851Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3480289Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3480751Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3481248Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3481697Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:28.3482114Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3482623Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3483062Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:28.3483499Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3483958Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3484434Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3484930Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3485468Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3486113Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:28.3486565Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3487050Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3487641Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:28.3488184Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3488748Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3489325Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3489918Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3490551Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3491141Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:28.3491685Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3492265Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3492815Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:28.3493350Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3493912Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3494486Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3495086Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3495706Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3496291Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:28.3496836Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3497415Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3497985Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:28.3498365Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3498768Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3499173Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3499609Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3500065Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3500485Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:28.3500866Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3501287Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3501758Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:28.3502303Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3502880Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3503459Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3504151Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3504784Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3505375Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:28.3505943Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3506527Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3506951Z 2025-05-07T19:52:28.3507121Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3507257Z 2025-05-07T19:52:28.3507325Z 2025-05-07T19:52:28.3507491Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3507813Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:52:28.3508264Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:52:28.3508761Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:52:28.3509102Z 2025-05-07T19:52:28.3509263Z OTHER_SRCS: 2025-05-07T19:52:28.3509370Z 2025-05-07T19:52:28.3509443Z 2025-05-07T19:52:28.3509598Z CC_FLAGS: 2025-05-07T19:52:28.3509709Z 2025-05-07T19:52:28.3509777Z 2025-05-07T19:52:28.3509936Z NVCC_FLAGS: 2025-05-07T19:52:28.3510149Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3510395Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3510658Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3510937Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3511166Z 2025-05-07T19:52:28.3511340Z HIPCC_FLAGS: 2025-05-07T19:52:28.3511451Z 2025-05-07T19:52:28.3511524Z 2025-05-07T19:52:28.3511695Z INCLUDE_DIRS: 2025-05-07T19:52:28.3511909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3512203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3512456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3512819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3513479Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3514285Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3514943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3515356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3515803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3516275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3516799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3517251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3517816Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3518327Z 2025-05-07T19:52:28.3518515Z Selected Source Files: 2025-05-07T19:52:28.3518882Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:52:28.3519315Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:28.3519669Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:28.3520038Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:28.3520407Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:28.3520732Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:28.3521068Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:28.3521418Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:28.3521810Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:28.3522251Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:28.3522632Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:28.3523044Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:28.3523477Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:28.3523896Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:28.3524411Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:28.3525055Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:28.3525722Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:28.3526196Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:28.3526584Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:28.3526925Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:28.3527270Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:28.3527597Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:28.3527985Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.3528409Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.3528816Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.3529280Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:28.3529718Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:28.3530191Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:28.3530660Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3531167Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3531700Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3532182Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:28.3532646Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3533123Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3533558Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:28.3533953Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3534388Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3534829Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3535287Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3535787Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3536231Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:28.3536651Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3537099Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3537548Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:28.3538007Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3538485Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3538986Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3539498Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3540045Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3540543Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:28.3541024Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3541526Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3541949Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:28.3542319Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3542708Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3543107Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3543528Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3543987Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3544408Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:28.3544839Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3545254Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3545631Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:28.3546007Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3546401Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3546806Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3547241Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3547707Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3548133Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:28.3548520Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3548996Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3549379Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:28.3566750Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3567198Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3567622Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3568058Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3568532Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3568955Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:28.3569345Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3569763Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3570160Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:28.3570567Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3571004Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3571446Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3571914Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3572401Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3572851Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:28.3573259Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3573709Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3574164Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:28.3574660Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3575187Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3575712Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3576267Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3576846Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3577380Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:28.3577877Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3578419Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3578927Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:28.3579415Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3579936Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3580456Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3581010Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3581586Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3582264Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:28.3582783Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3583313Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3583777Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:28.3584155Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3584561Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3584958Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3585392Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3586214Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3586744Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:28.3587171Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3587624Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3588142Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:28.3588736Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3589368Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3590003Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3590664Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3591353Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3591989Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:28.3592690Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3593335Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3593791Z 2025-05-07T19:52:28.3593994Z HIPified Source Files: 2025-05-07T19:52:28.3594152Z 2025-05-07T19:52:28.3594229Z 2025-05-07T19:52:28.3594430Z Library Dependencies: 2025-05-07T19:52:28.3594652Z torch 2025-05-07T19:52:28.3594841Z torch_library 2025-05-07T19:52:28.3595282Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3595885Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3596487Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3597291Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3597950Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3598323Z fbgemm 2025-05-07T19:52:28.3598523Z fbgemm_gpu_config 2025-05-07T19:52:28.3598744Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:28.3598984Z fbgemm_gpu_tbe_common 2025-05-07T19:52:28.3599213Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3599455Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:28.3599843Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3600250Z 2025-05-07T19:52:28.3600437Z Output Library: 2025-05-07T19:52:28.3600663Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:28.3600926Z 2025-05-07T19:52:28.3601111Z Destination Directory: 2025-05-07T19:52:28.3601340Z fbgemm_gpu 2025-05-07T19:52:28.3601565Z ================================================================================ 2025-05-07T19:52:28.3601804Z 2025-05-07T19:52:28.3601809Z 2025-05-07T19:52:28.3601813Z 2025-05-07T19:52:28.3601923Z ================================================================================ 2025-05-07T19:52:28.3602359Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:52:28.3602750Z 2025-05-07T19:52:28.3602932Z CPU_SRCS: 2025-05-07T19:52:28.3603042Z 2025-05-07T19:52:28.3603237Z 2025-05-07T19:52:28.3603421Z GPU_SRCS: 2025-05-07T19:52:28.3603722Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:28.3604252Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:28.3604807Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:28.3605444Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:28.3605950Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:28.3606479Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:28.3606991Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:28.3607498Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:28.3608136Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:28.3608664Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:28.3609194Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:28.3609751Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:28.3610150Z 2025-05-07T19:52:28.3610325Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3610457Z 2025-05-07T19:52:28.3610527Z 2025-05-07T19:52:28.3610708Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3610832Z 2025-05-07T19:52:28.3610901Z 2025-05-07T19:52:28.3611070Z OTHER_SRCS: 2025-05-07T19:52:28.3611178Z 2025-05-07T19:52:28.3611248Z 2025-05-07T19:52:28.3611415Z CC_FLAGS: 2025-05-07T19:52:28.3611515Z 2025-05-07T19:52:28.3611591Z 2025-05-07T19:52:28.3611751Z NVCC_FLAGS: 2025-05-07T19:52:28.3611950Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3612198Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3612465Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3612729Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3612959Z 2025-05-07T19:52:28.3613128Z HIPCC_FLAGS: 2025-05-07T19:52:28.3613249Z 2025-05-07T19:52:28.3613319Z 2025-05-07T19:52:28.3613495Z INCLUDE_DIRS: 2025-05-07T19:52:28.3613702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3613984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3614233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3614513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3614957Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3615690Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3616288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3616670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3617070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3617506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3617990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3618408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3618933Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3619393Z 2025-05-07T19:52:28.3619574Z Selected Source Files: 2025-05-07T19:52:28.3619893Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:28.3620384Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:28.3620906Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:28.3621401Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:28.3621898Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:28.3622421Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:28.3622994Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:28.3623517Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:28.3624055Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:28.3624585Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:28.3625113Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:28.3625350Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:28.3625423Z 2025-05-07T19:52:28.3625510Z HIPified Source Files: 2025-05-07T19:52:28.3625514Z 2025-05-07T19:52:28.3625581Z 2025-05-07T19:52:28.3625673Z Library Dependencies: 2025-05-07T19:52:28.3625744Z torch 2025-05-07T19:52:28.3625820Z torch_library 2025-05-07T19:52:28.3626173Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3626335Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3626644Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3626982Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3627160Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3627255Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:28.3627456Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3627532Z 2025-05-07T19:52:28.3627614Z Output Library: 2025-05-07T19:52:28.3627713Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:52:28.3627787Z 2025-05-07T19:52:28.3627871Z Destination Directory: 2025-05-07T19:52:28.3627946Z fbgemm_gpu 2025-05-07T19:52:28.3628048Z ================================================================================ 2025-05-07T19:52:28.3628056Z 2025-05-07T19:52:28.3628060Z 2025-05-07T19:52:28.3628070Z 2025-05-07T19:52:28.3628172Z ================================================================================ 2025-05-07T19:52:28.3628361Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:52:28.3628432Z 2025-05-07T19:52:28.3628509Z CPU_SRCS: 2025-05-07T19:52:28.3628512Z 2025-05-07T19:52:28.3628580Z 2025-05-07T19:52:28.3628652Z GPU_SRCS: 2025-05-07T19:52:28.3628843Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3629018Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3629209Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3629388Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3629620Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3629860Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3630000Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3630158Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3630304Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3630461Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3630611Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3630761Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3630938Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3631153Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3631360Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3631533Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3631728Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3631935Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3632184Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3632394Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3632695Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3632878Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3633080Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3633476Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3633719Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3633987Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3634328Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3634586Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3634856Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3635131Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3635287Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3635459Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3635634Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3635795Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3635975Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3636163Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3636330Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3636514Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3636702Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3636864Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3637060Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3637250Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3637401Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3637583Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3637761Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3637920Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3638113Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3638301Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3638381Z 2025-05-07T19:52:28.3638469Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3638474Z 2025-05-07T19:52:28.3638557Z 2025-05-07T19:52:28.3638645Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3638649Z 2025-05-07T19:52:28.3638720Z 2025-05-07T19:52:28.3638811Z OTHER_SRCS: 2025-05-07T19:52:28.3638815Z 2025-05-07T19:52:28.3638889Z 2025-05-07T19:52:28.3638964Z CC_FLAGS: 2025-05-07T19:52:28.3638968Z 2025-05-07T19:52:28.3639050Z 2025-05-07T19:52:28.3639130Z NVCC_FLAGS: 2025-05-07T19:52:28.3639226Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3639322Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3639431Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3639525Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3639596Z 2025-05-07T19:52:28.3639683Z HIPCC_FLAGS: 2025-05-07T19:52:28.3639687Z 2025-05-07T19:52:28.3639761Z 2025-05-07T19:52:28.3639844Z INCLUDE_DIRS: 2025-05-07T19:52:28.3639952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3640058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3640169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3640273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3640623Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3641016Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3641159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3641319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3641483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3641686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3641887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3642037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3642348Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3642477Z 2025-05-07T19:52:28.3642575Z Selected Source Files: 2025-05-07T19:52:28.3642780Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3642972Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3643180Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3643379Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3643628Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3643885Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3644041Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3644201Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3644359Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3644533Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3644688Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.3644853Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.3645052Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3645269Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3645596Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3645769Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3645973Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3646167Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3646351Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3646571Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3646782Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3646963Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3647175Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3647380Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3647606Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3647856Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3648108Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3648341Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3648592Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3648851Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3648992Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3649218Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3649385Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3649527Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3649693Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3649869Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3650011Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3650178Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3650344Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3650501Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3650679Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3650908Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3651055Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.3651222Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3651387Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3651541Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.3651712Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.3651887Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.3651958Z 2025-05-07T19:52:28.3652051Z HIPified Source Files: 2025-05-07T19:52:28.3652055Z 2025-05-07T19:52:28.3652123Z 2025-05-07T19:52:28.3652208Z Library Dependencies: 2025-05-07T19:52:28.3652285Z torch 2025-05-07T19:52:28.3652359Z torch_library 2025-05-07T19:52:28.3652650Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3652808Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3653126Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3653452Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3653626Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3653729Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:28.3653926Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3653996Z 2025-05-07T19:52:28.3654081Z Output Library: 2025-05-07T19:52:28.3654179Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:52:28.3654250Z 2025-05-07T19:52:28.3654336Z Destination Directory: 2025-05-07T19:52:28.3654415Z fbgemm_gpu 2025-05-07T19:52:28.3654519Z ================================================================================ 2025-05-07T19:52:28.3654523Z 2025-05-07T19:52:28.3654653Z 2025-05-07T19:52:28.3654661Z 2025-05-07T19:52:28.3654759Z ================================================================================ 2025-05-07T19:52:28.3655121Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:52:28.3655191Z 2025-05-07T19:52:28.3655265Z CPU_SRCS: 2025-05-07T19:52:28.3655269Z 2025-05-07T19:52:28.3655343Z 2025-05-07T19:52:28.3655417Z GPU_SRCS: 2025-05-07T19:52:28.3655550Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:28.3655691Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:28.3655842Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3655997Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3656154Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3656321Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:28.3656499Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3656683Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3656826Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:28.3657419Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:28.3657578Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3657749Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3657852Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:28.3657922Z 2025-05-07T19:52:28.3658004Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3658008Z 2025-05-07T19:52:28.3658083Z 2025-05-07T19:52:28.3658161Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3658165Z 2025-05-07T19:52:28.3658231Z 2025-05-07T19:52:28.3658313Z OTHER_SRCS: 2025-05-07T19:52:28.3658317Z 2025-05-07T19:52:28.3658384Z 2025-05-07T19:52:28.3658458Z CC_FLAGS: 2025-05-07T19:52:28.3658462Z 2025-05-07T19:52:28.3658527Z 2025-05-07T19:52:28.3658610Z NVCC_FLAGS: 2025-05-07T19:52:28.3658702Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3658843Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3658947Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3659038Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3659107Z 2025-05-07T19:52:28.3659180Z HIPCC_FLAGS: 2025-05-07T19:52:28.3659191Z 2025-05-07T19:52:28.3659259Z 2025-05-07T19:52:28.3659335Z INCLUDE_DIRS: 2025-05-07T19:52:28.3659438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3659534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3659629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3659726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3659997Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3660361Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3660492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3660642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3660797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3660987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3661173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3661310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3661599Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3661667Z 2025-05-07T19:52:28.3661760Z Selected Source Files: 2025-05-07T19:52:28.3661897Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:28.3662061Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:28.3662201Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:28.3662307Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:28.3662435Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:28.3662588Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.3662746Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.3662909Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.3663085Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.3663272Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.3663411Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:28.3663569Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.3663733Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.3663807Z 2025-05-07T19:52:28.3663889Z HIPified Source Files: 2025-05-07T19:52:28.3663893Z 2025-05-07T19:52:28.3663960Z 2025-05-07T19:52:28.3664048Z Library Dependencies: 2025-05-07T19:52:28.3664117Z torch 2025-05-07T19:52:28.3664191Z torch_library 2025-05-07T19:52:28.3664477Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3664639Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3664995Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3665320Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3665500Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3665593Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:28.3665787Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3665864Z 2025-05-07T19:52:28.3665940Z Output Library: 2025-05-07T19:52:28.3666040Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:52:28.3666111Z 2025-05-07T19:52:28.3666202Z Destination Directory: 2025-05-07T19:52:28.3666276Z fbgemm_gpu 2025-05-07T19:52:28.3666380Z ================================================================================ 2025-05-07T19:52:28.3666435Z 2025-05-07T19:52:28.3666438Z 2025-05-07T19:52:28.3666441Z 2025-05-07T19:52:28.3666549Z ================================================================================ 2025-05-07T19:52:28.3666761Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:52:28.3666830Z 2025-05-07T19:52:28.3666909Z CPU_SRCS: 2025-05-07T19:52:28.3666913Z 2025-05-07T19:52:28.3666980Z 2025-05-07T19:52:28.3667050Z GPU_SRCS: 2025-05-07T19:52:28.3667158Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:28.3667289Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:28.3667385Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:28.3667484Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:28.3667590Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:28.3667695Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:28.3667835Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:28.3667975Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:28.3668083Z gen_embedding_backward_split_none.cpp 2025-05-07T19:52:28.3668258Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:28.3668371Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:28.3668521Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:28.3668714Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:28.3668923Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:28.3669116Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:28.3669269Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:28.3669388Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:28.3669528Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:28.3669684Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:28.3669855Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:28.3670037Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:28.3670177Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:28.3670316Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:28.3670444Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:28.3670589Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:28.3670716Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:28.3670852Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:28.3670995Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:28.3671158Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:28.3671348Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:28.3671543Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:28.3671740Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:28.3671936Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:28.3672116Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:28.3672262Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:28.3672473Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:28.3672779Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:28.3672850Z 2025-05-07T19:52:28.3673108Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3673113Z 2025-05-07T19:52:28.3673185Z 2025-05-07T19:52:28.3673291Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3673295Z 2025-05-07T19:52:28.3673372Z 2025-05-07T19:52:28.3673465Z OTHER_SRCS: 2025-05-07T19:52:28.3673469Z 2025-05-07T19:52:28.3673544Z 2025-05-07T19:52:28.3673682Z CC_FLAGS: 2025-05-07T19:52:28.3673686Z 2025-05-07T19:52:28.3673773Z 2025-05-07T19:52:28.3673911Z NVCC_FLAGS: 2025-05-07T19:52:28.3674011Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3674114Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3674234Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3674332Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3674409Z 2025-05-07T19:52:28.3674498Z HIPCC_FLAGS: 2025-05-07T19:52:28.3674502Z 2025-05-07T19:52:28.3674575Z 2025-05-07T19:52:28.3674657Z INCLUDE_DIRS: 2025-05-07T19:52:28.3674770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3674874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3674981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3675088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3675380Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3675777Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3675929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3676105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3676261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3676466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3676669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3676824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3677135Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3677214Z 2025-05-07T19:52:28.3677316Z Selected Source Files: 2025-05-07T19:52:28.3677430Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:28.3677566Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:28.3677686Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:28.3677791Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:28.3677895Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:28.3678014Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:28.3678174Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:28.3678328Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:28.3678436Z gen_embedding_backward_split_none.cpp 2025-05-07T19:52:28.3678628Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:28.3678745Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:28.3678897Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:28.3679104Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:28.3679338Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:28.3679535Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:28.3679697Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:28.3679832Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:28.3679993Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:28.3680153Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:28.3680416Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:28.3680610Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:28.3680753Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:28.3680899Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:28.3681051Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:28.3681208Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:28.3681347Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:28.3681504Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:28.3681659Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:28.3681823Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:28.3682037Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:28.3682307Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:28.3682511Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:28.3682733Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:28.3682872Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:28.3683023Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:28.3683253Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:28.3683501Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:28.3683577Z 2025-05-07T19:52:28.3683670Z HIPified Source Files: 2025-05-07T19:52:28.3683675Z 2025-05-07T19:52:28.3683759Z 2025-05-07T19:52:28.3683852Z Library Dependencies: 2025-05-07T19:52:28.3683927Z torch 2025-05-07T19:52:28.3684011Z torch_library 2025-05-07T19:52:28.3684331Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3684498Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3684829Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3685191Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3685379Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3685470Z fbgemm_gpu_config 2025-05-07T19:52:28.3685567Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3685977Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3686053Z 2025-05-07T19:52:28.3686139Z Output Library: 2025-05-07T19:52:28.3686261Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:52:28.3686332Z 2025-05-07T19:52:28.3686425Z Destination Directory: 2025-05-07T19:52:28.3686513Z fbgemm_gpu 2025-05-07T19:52:28.3686626Z ================================================================================ 2025-05-07T19:52:28.3686631Z 2025-05-07T19:52:28.3686635Z 2025-05-07T19:52:28.3686639Z 2025-05-07T19:52:28.3686749Z ================================================================================ 2025-05-07T19:52:28.3686929Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:52:28.3687001Z 2025-05-07T19:52:28.3687076Z CPU_SRCS: 2025-05-07T19:52:28.3687284Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:52:28.3687479Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:52:28.3687552Z 2025-05-07T19:52:28.3687629Z GPU_SRCS: 2025-05-07T19:52:28.3687829Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:52:28.3687965Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:28.3688087Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:28.3688223Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:28.3688378Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:28.3688511Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:28.3688751Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:28.3688900Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:28.3688975Z 2025-05-07T19:52:28.3689066Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3689070Z 2025-05-07T19:52:28.3689160Z 2025-05-07T19:52:28.3689247Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3689251Z 2025-05-07T19:52:28.3689328Z 2025-05-07T19:52:28.3689407Z OTHER_SRCS: 2025-05-07T19:52:28.3689411Z 2025-05-07T19:52:28.3689498Z 2025-05-07T19:52:28.3689580Z CC_FLAGS: 2025-05-07T19:52:28.3689584Z 2025-05-07T19:52:28.3689659Z 2025-05-07T19:52:28.3689751Z NVCC_FLAGS: 2025-05-07T19:52:28.3689848Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3689947Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3690050Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3690265Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3690340Z 2025-05-07T19:52:28.3690421Z HIPCC_FLAGS: 2025-05-07T19:52:28.3690426Z 2025-05-07T19:52:28.3690514Z 2025-05-07T19:52:28.3690601Z INCLUDE_DIRS: 2025-05-07T19:52:28.3690710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3690808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3690922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3691025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3691312Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3691718Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3691860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3692020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3692187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3692391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3692595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3692749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3693068Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3693144Z 2025-05-07T19:52:28.3693236Z Selected Source Files: 2025-05-07T19:52:28.3693455Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:52:28.3693646Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:52:28.3693839Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:52:28.3693980Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:28.3694102Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:28.3694238Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:28.3694380Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:28.3694525Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:28.3694660Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:28.3694798Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:28.3694884Z 2025-05-07T19:52:28.3694976Z HIPified Source Files: 2025-05-07T19:52:28.3694980Z 2025-05-07T19:52:28.3695054Z 2025-05-07T19:52:28.3695148Z Library Dependencies: 2025-05-07T19:52:28.3695233Z torch 2025-05-07T19:52:28.3695312Z torch_library 2025-05-07T19:52:28.3695622Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3695798Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3696135Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3696487Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3696683Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3696791Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:28.3696877Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3697144Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3697344Z 2025-05-07T19:52:28.3697429Z Output Library: 2025-05-07T19:52:28.3697637Z fbgemm_gpu_tbe_index_select 2025-05-07T19:52:28.3697718Z 2025-05-07T19:52:28.3697801Z Destination Directory: 2025-05-07T19:52:28.3697872Z fbgemm_gpu 2025-05-07T19:52:28.3697977Z ================================================================================ 2025-05-07T19:52:28.3697988Z 2025-05-07T19:52:28.3697992Z 2025-05-07T19:52:28.3697995Z 2025-05-07T19:52:28.3698093Z ================================================================================ 2025-05-07T19:52:28.3698271Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:52:28.3698338Z 2025-05-07T19:52:28.3698419Z CPU_SRCS: 2025-05-07T19:52:28.3698581Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:52:28.3698711Z 2025-05-07T19:52:28.3698783Z GPU_SRCS: 2025-05-07T19:52:28.3698950Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:52:28.3699092Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:52:28.3699159Z 2025-05-07T19:52:28.3699247Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3699251Z 2025-05-07T19:52:28.3699315Z 2025-05-07T19:52:28.3699392Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3699396Z 2025-05-07T19:52:28.3699470Z 2025-05-07T19:52:28.3699544Z OTHER_SRCS: 2025-05-07T19:52:28.3699548Z 2025-05-07T19:52:28.3699615Z 2025-05-07T19:52:28.3699683Z CC_FLAGS: 2025-05-07T19:52:28.3699687Z 2025-05-07T19:52:28.3699760Z 2025-05-07T19:52:28.3699831Z NVCC_FLAGS: 2025-05-07T19:52:28.3699919Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3700015Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3700107Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3700194Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3700263Z 2025-05-07T19:52:28.3700340Z HIPCC_FLAGS: 2025-05-07T19:52:28.3700344Z 2025-05-07T19:52:28.3700409Z 2025-05-07T19:52:28.3700481Z INCLUDE_DIRS: 2025-05-07T19:52:28.3700587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3700672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3700770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3700863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3701127Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3701487Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3701616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3701769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3701914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3702101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3702295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3702429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3702715Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3702782Z 2025-05-07T19:52:28.3702867Z Selected Source Files: 2025-05-07T19:52:28.3703027Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:52:28.3703184Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:52:28.3703332Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:52:28.3703400Z 2025-05-07T19:52:28.3703482Z HIPified Source Files: 2025-05-07T19:52:28.3703487Z 2025-05-07T19:52:28.3703555Z 2025-05-07T19:52:28.3703636Z Library Dependencies: 2025-05-07T19:52:28.3703705Z torch 2025-05-07T19:52:28.3703780Z torch_library 2025-05-07T19:52:28.3704070Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3704227Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3704578Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3704909Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3705083Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3705276Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3705348Z 2025-05-07T19:52:28.3705426Z Output Library: 2025-05-07T19:52:28.3705520Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:52:28.3705586Z 2025-05-07T19:52:28.3705674Z Destination Directory: 2025-05-07T19:52:28.3705746Z fbgemm_gpu 2025-05-07T19:52:28.3705848Z ================================================================================ 2025-05-07T19:52:28.3705852Z 2025-05-07T19:52:28.3705856Z 2025-05-07T19:52:28.3705859Z 2025-05-07T19:52:28.3706043Z ================================================================================ 2025-05-07T19:52:28.3706160Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:52:28.3706231Z 2025-05-07T19:52:28.3706305Z CPU_SRCS: 2025-05-07T19:52:28.3706402Z src/memory_utils/memory_utils.cpp 2025-05-07T19:52:28.3706496Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:52:28.3706681Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:28.3706882Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:52:28.3707073Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:52:28.3707274Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:52:28.3707477Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:28.3707696Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:52:28.3707832Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:52:28.3707961Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:52:28.3708086Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:52:28.3708196Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:52:28.3708333Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:52:28.3708438Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:52:28.3708534Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:52:28.3708650Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:52:28.3708748Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:52:28.3708840Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:52:28.3708925Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:52:28.3709007Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:52:28.3709105Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:52:28.3709195Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:52:28.3709287Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:52:28.3709382Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:52:28.3709604Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:52:28.3709746Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:52:28.3709944Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:28.3710169Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:52:28.3710266Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:52:28.3710356Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:52:28.3710452Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:52:28.3710560Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:52:28.3710746Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:28.3710839Z src/topology_utils.cpp 2025-05-07T19:52:28.3710909Z 2025-05-07T19:52:28.3710981Z GPU_SRCS: 2025-05-07T19:52:28.3711087Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:52:28.3711191Z src/input_combine_ops/input_combine.cu 2025-05-07T19:52:28.3711392Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:52:28.3711485Z src/memory_utils/memory_utils.cu 2025-05-07T19:52:28.3711638Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:52:28.3711823Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:52:28.3711998Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:52:28.3712124Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:52:28.3712256Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:52:28.3712501Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:52:28.3712760Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:52:28.3713114Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:52:28.3713262Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:52:28.3713418Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:52:28.3713627Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:52:28.3713759Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:52:28.3713894Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:52:28.3714010Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:52:28.3714182Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:52:28.3714336Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:52:28.3714464Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:52:28.3714628Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:52:28.3714763Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:52:28.3714869Z src/metric_ops/metric_ops.cu 2025-05-07T19:52:28.3715094Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:52:28.3715301Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:52:28.3715486Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:52:28.3715597Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:52:28.3715720Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:52:28.3715847Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:52:28.3715974Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:52:28.3716081Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:52:28.3716179Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:52:28.3716302Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:52:28.3716401Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:52:28.3716528Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:52:28.3716661Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:52:28.3716778Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:52:28.3716916Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:52:28.3717057Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:52:28.3717201Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:52:28.3717307Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:52:28.3717410Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:52:28.3717514Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:52:28.3717620Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:52:28.3717754Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:52:28.3717879Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:52:28.3717981Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:52:28.3718086Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:52:28.3718186Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:52:28.3718301Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:52:28.3718396Z src/sparse_ops/sparse_range.cu 2025-05-07T19:52:28.3718515Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:52:28.3718626Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:52:28.3718721Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:52:28.3718801Z 2025-05-07T19:52:28.3718885Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:28.3718890Z 2025-05-07T19:52:28.3718961Z 2025-05-07T19:52:28.3719045Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:28.3719103Z 2025-05-07T19:52:28.3719180Z 2025-05-07T19:52:28.3719259Z OTHER_SRCS: 2025-05-07T19:52:28.3719263Z 2025-05-07T19:52:28.3719334Z 2025-05-07T19:52:28.3719414Z CC_FLAGS: 2025-05-07T19:52:28.3719418Z 2025-05-07T19:52:28.3719488Z 2025-05-07T19:52:28.3719567Z NVCC_FLAGS: 2025-05-07T19:52:28.3719661Z --expt-relaxed-constexpr 2025-05-07T19:52:28.3719758Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:28.3719855Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:28.3719946Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:28.3720023Z 2025-05-07T19:52:28.3720102Z HIPCC_FLAGS: 2025-05-07T19:52:28.3720106Z 2025-05-07T19:52:28.3720177Z 2025-05-07T19:52:28.3720256Z INCLUDE_DIRS: 2025-05-07T19:52:28.3720367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3720461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:28.3720615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:28.3720721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:28.3721007Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include 2025-05-07T19:52:28.3721399Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:28.3721547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:28.3721706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:28.3721860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:28.3722062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:28.3722266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:28.3722408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:28.3722717Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include 2025-05-07T19:52:28.3722803Z 2025-05-07T19:52:28.3722892Z Selected Source Files: 2025-05-07T19:52:28.3722991Z src/memory_utils/memory_utils.cpp 2025-05-07T19:52:28.3723105Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:52:28.3723305Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:28.3723517Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:52:28.3723729Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:52:28.3723952Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:52:28.3724169Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:28.3724407Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:52:28.3724565Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:52:28.3724698Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:52:28.3724827Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:52:28.3724955Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:52:28.3725104Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:52:28.3725325Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:52:28.3725424Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:52:28.3725554Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:52:28.3725649Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:52:28.3725743Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:52:28.3725833Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:52:28.3725918Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:52:28.3726015Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:52:28.3726106Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:52:28.3726210Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:52:28.3726303Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:52:28.3726525Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:52:28.3726675Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:52:28.3726884Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:28.3727157Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:52:28.3727260Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:52:28.3727351Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:52:28.3727447Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:52:28.3727561Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:52:28.3727752Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:28.3727835Z src/topology_utils.cpp 2025-05-07T19:52:28.3727941Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:52:28.3728049Z src/input_combine_ops/input_combine.cu 2025-05-07T19:52:28.3728248Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:52:28.3728342Z src/memory_utils/memory_utils.cu 2025-05-07T19:52:28.3728440Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:52:28.3728679Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:52:28.3728857Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:52:28.3728979Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:52:28.3729111Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:52:28.3729347Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:52:28.3729518Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:52:28.3729690Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:52:28.3729827Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:52:28.3729972Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:52:28.3730100Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:52:28.3730233Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:52:28.3730352Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:52:28.3730465Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:52:28.3730627Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:52:28.3730769Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:52:28.3730884Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:52:28.3731032Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:52:28.3731157Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:52:28.3731251Z src/metric_ops/metric_ops.cu 2025-05-07T19:52:28.3731459Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:52:28.3731646Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:52:28.3731820Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:52:28.3731920Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:52:28.3732029Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:52:28.3732149Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:52:28.3732269Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:52:28.3732366Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:52:28.3732465Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:52:28.3732583Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:52:28.3732674Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:52:28.3732795Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:52:28.3732920Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:52:28.3733032Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:52:28.3733167Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:52:28.3733297Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:52:28.3733428Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:52:28.3733526Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:52:28.3733625Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:52:28.3733723Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:52:28.3733828Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:52:28.3733955Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:52:28.3734128Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:52:28.3734229Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:52:28.3734325Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:52:28.3734433Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:52:28.3734544Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:52:28.3734639Z src/sparse_ops/sparse_range.cu 2025-05-07T19:52:28.3734757Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:52:28.3734863Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:52:28.3734954Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:52:28.3735024Z 2025-05-07T19:52:28.3735116Z HIPified Source Files: 2025-05-07T19:52:28.3735120Z 2025-05-07T19:52:28.3735191Z 2025-05-07T19:52:28.3735278Z Library Dependencies: 2025-05-07T19:52:28.3735359Z torch 2025-05-07T19:52:28.3735501Z torch_library 2025-05-07T19:52:28.3735791Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so 2025-05-07T19:52:28.3735959Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:28.3736270Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:28.3736597Z /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:28.3736772Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:28.3736855Z fbgemm 2025-05-07T19:52:28.3736954Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:28.3737051Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:52:28.3737151Z fbgemm_gpu_tbe_index_select 2025-05-07T19:52:28.3737232Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:28.3737321Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:52:28.3737398Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:28.3737606Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:28.3737679Z 2025-05-07T19:52:28.3737762Z Output Library: 2025-05-07T19:52:28.3737848Z fbgemm_gpu_py 2025-05-07T19:52:28.3737920Z 2025-05-07T19:52:28.3738009Z Destination Directory: 2025-05-07T19:52:28.3738083Z fbgemm_gpu 2025-05-07T19:52:28.3738197Z ================================================================================ 2025-05-07T19:52:28.3738201Z 2025-05-07T19:52:28.3738290Z -- Configuring done (8.5s) 2025-05-07T19:52:28.4881303Z -- Generating done (0.1s) 2025-05-07T19:52:28.4904180Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build 2025-05-07T19:52:28.5076913Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build' 2025-05-07T19:52:28.5076934Z 2025-05-07T19:52:28.5077239Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:52:28.6114836Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:52:28.6129410Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6327755Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:52:28.6339460Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6350257Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:52:28.6360884Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6413024Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:52:28.6423784Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6447517Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:52:28.6458974Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6552913Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:52:28.6563119Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6573381Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:52:28.6584547Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6630168Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:52:28.6641329Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6652351Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:52:28.6663598Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6795353Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:52:28.6806988Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.6956108Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:52:28.6967011Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7152960Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:52:28.7164054Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7276140Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:52:28.7287689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7318798Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:52:28.7329706Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7340122Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:52:28.7350993Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7414353Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:52:28.7425428Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7483277Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:52:28.7494760Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7563925Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:52:28.7574881Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7669993Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:52:28.7680578Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7821195Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:52:28.7832380Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.7922190Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:52:28.7933740Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8089993Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:52:28.8101426Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8260763Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:52:28.8271965Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8320169Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:52:28.8331720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8343532Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:52:28.8355183Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8385642Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:52:28.8397169Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8514614Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:52:28.8520731Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8526621Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:52:28.8532687Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8538749Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:52:28.8544711Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8660724Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:52:28.8671863Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8684173Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:52:28.8695074Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8822044Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:52:28.8833409Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.8951482Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:52:28.8962509Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9029809Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:52:28.9040901Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9051703Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:52:28.9062489Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9228090Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:52:28.9239542Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9365298Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:52:28.9376636Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9497881Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:52:28.9509111Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9633083Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:52:28.9644689Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9654900Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:52:28.9666064Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:28.9825257Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:52:28.9837055Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.0072206Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:52:29.0083668Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.0212130Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:52:29.0222958Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.0565856Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:52:29.0576862Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.0588317Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:52:29.0600154Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.1392292Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:52:29.1403799Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.1966040Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:52:29.1976885Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.2236750Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:52:29.2248859Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.2270255Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:52:29.2282079Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.2400968Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:52:29.2412772Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.2451706Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:52:29.2463670Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.2570828Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:52:29.2582727Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.3302397Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:52:29.3314132Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.3907730Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:52:29.3919308Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.4768164Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:29.4785298Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6138374Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:52:29.6155599Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6168636Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:52:29.6179122Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6321992Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:52:29.6332650Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7774318Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:52:29.7791188Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8438724Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:52:29.8449859Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9533342Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:52:29.9545524Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.2457684Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:52:30.2469879Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.9004967Z [63/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:52:30.9021612Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.9348772Z [64/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:31.1906148Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:52:31.1922899Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:34.5597517Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:52:34.5613828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:34.8149448Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:52:34.8170888Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:36.3001841Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:36.3834954Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:36.3851115Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:36.3867834Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:36.4252710Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:36.4268992Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:36.5427809Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:36.5445588Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:36.5746513Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:52:36.5762489Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.3236423Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:37.3254862Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.6306461Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:37.6320461Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:38.5536785Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:38.5546868Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:39.1583142Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:52:39.7733368Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:39.7753290Z [77/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:40.0723823Z [78/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:52:40.0733509Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:40.6826969Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:40.6844530Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:42.7978522Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:42.7997070Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:42.9816756Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:42.9834902Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:44.3275625Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:44.3293597Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:47.0722765Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:47.0740494Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:47.1151931Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:47.1169816Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:47.3047905Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:47.3065113Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:49.2537925Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:49.2567354Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:50.7177559Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:50.7194799Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:54.1983118Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:54.1996465Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:55.5964583Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:55.5983003Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:56.4629576Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:56.4648374Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:57.6955254Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:57.6973323Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:59.3677450Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:59.3694733Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:00.9889699Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:53:00.9906813Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:02.3936526Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:53:02.3955579Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:03.8029521Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:53:03.8048218Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:04.8073676Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:53:06.3751938Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:06.3770223Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:53:06.3789470Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:06.9842433Z [98/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:53:06.9864411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.9866371Z 2025-05-07T19:53:06.9868074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.9870002Z 2025-05-07T19:53:06.9871706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.9873790Z 2025-05-07T19:53:06.9875474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.9877435Z 2025-05-07T19:53:06.9879139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.9881051Z 2025-05-07T19:53:06.9882594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:06.9883993Z 2025-05-07T19:53:07.5374648Z [99/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:53:07.5395547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.5397366Z 2025-05-07T19:53:07.5399089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.5400627Z 2025-05-07T19:53:07.5401832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu(13): warning #177-D: variable "::TORCH_LIBRARY_FRAGMENT_static_init_fbgemm_2" was declared but never referenced 2025-05-07T19:53:07.5403202Z 2025-05-07T19:53:07.5404741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.5406514Z 2025-05-07T19:53:07.5408077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.5409701Z 2025-05-07T19:53:07.5411067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu(13): warning #177-D: variable "::TORCH_LIBRARY_FRAGMENT_static_init_fbgemm_2" was declared but never referenced 2025-05-07T19:53:07.5412643Z 2025-05-07T19:53:07.5414169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.5415966Z 2025-05-07T19:53:07.5417473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.5419053Z 2025-05-07T19:53:07.6563209Z [100/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:53:07.6581640Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:07.7703909Z [101/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:53:07.7725841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.7727751Z 2025-05-07T19:53:07.7729251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.7731457Z 2025-05-07T19:53:07.7733042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.7734862Z 2025-05-07T19:53:07.7736533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.7738437Z 2025-05-07T19:53:07.7739993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.7741803Z 2025-05-07T19:53:07.7743481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:07.7745371Z 2025-05-07T19:53:08.0735704Z [102/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:53:08.0755816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.0757505Z 2025-05-07T19:53:08.0758973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.0761189Z 2025-05-07T19:53:08.0762798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.0764676Z 2025-05-07T19:53:08.0766349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.0768243Z 2025-05-07T19:53:08.0769953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.0771789Z 2025-05-07T19:53:08.0773414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.0775366Z 2025-05-07T19:53:08.1673226Z [103/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:53:08.1694518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.1696243Z 2025-05-07T19:53:08.1697640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.1699689Z 2025-05-07T19:53:08.1701470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.1703024Z 2025-05-07T19:53:08.1704521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.1706192Z 2025-05-07T19:53:08.1707793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.1709528Z 2025-05-07T19:53:08.1711091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.1713178Z 2025-05-07T19:53:08.5164063Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:53:08.5185584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.5187801Z 2025-05-07T19:53:08.5189601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.5191833Z 2025-05-07T19:53:08.5193615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.5195518Z 2025-05-07T19:53:08.5197255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.5199190Z 2025-05-07T19:53:08.5200867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.5202797Z 2025-05-07T19:53:08.5204433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.5206341Z 2025-05-07T19:53:08.9149893Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:53:08.9170223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9172285Z 2025-05-07T19:53:08.9173785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9175495Z 2025-05-07T19:53:08.9176929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9178413Z 2025-05-07T19:53:08.9179750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9181039Z 2025-05-07T19:53:08.9182586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9184299Z 2025-05-07T19:53:08.9186215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9188065Z 2025-05-07T19:53:08.9808761Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:53:08.9827709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9829711Z 2025-05-07T19:53:08.9831115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9832885Z 2025-05-07T19:53:08.9834119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9835582Z 2025-05-07T19:53:08.9836834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9838286Z 2025-05-07T19:53:08.9839714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9841000Z 2025-05-07T19:53:08.9842429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9843715Z 2025-05-07T19:53:09.3000908Z [107/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:53:09.3018887Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:10.6472185Z [108/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:53:10.6492290Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:11.7345422Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:53:12.0044641Z [110/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:53:12.0062271Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:12.8271155Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:53:12.8287548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.8289232Z 2025-05-07T19:53:12.8290576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.8292069Z 2025-05-07T19:53:12.8293398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.8294823Z 2025-05-07T19:53:12.8296031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.8297364Z 2025-05-07T19:53:12.8298700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.8300277Z 2025-05-07T19:53:12.8301656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:12.8303206Z 2025-05-07T19:53:18.4505077Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:53:18.4527166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.4528765Z 2025-05-07T19:53:18.4530199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.4531861Z 2025-05-07T19:53:18.4533425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.4535264Z 2025-05-07T19:53:18.4536777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.4538476Z 2025-05-07T19:53:18.4540042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.4541824Z 2025-05-07T19:53:18.4543442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:18.4545246Z 2025-05-07T19:53:19.7045518Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:53:19.7068238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7070238Z 2025-05-07T19:53:19.7072020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7074114Z 2025-05-07T19:53:19.7075739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7077218Z 2025-05-07T19:53:19.7078472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7079787Z 2025-05-07T19:53:19.7081313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7083119Z 2025-05-07T19:53:19.7084378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.7086341Z 2025-05-07T19:53:20.4390038Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:53:20.4410680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4412736Z 2025-05-07T19:53:20.4414508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4416445Z 2025-05-07T19:53:20.4418115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4420040Z 2025-05-07T19:53:20.4421804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4423761Z 2025-05-07T19:53:20.4425359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4427292Z 2025-05-07T19:53:20.4429042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.4431024Z 2025-05-07T19:53:20.4824472Z [115/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:53:20.8363143Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:20.8382019Z [116/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:53:20.8400273Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:24.1787624Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:53:24.1806275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.1807765Z 2025-05-07T19:53:24.1809066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.1810553Z 2025-05-07T19:53:24.1811847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.1813352Z 2025-05-07T19:53:24.1814667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.1816145Z 2025-05-07T19:53:24.1817464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.1818930Z 2025-05-07T19:53:24.1820226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:24.1821828Z 2025-05-07T19:53:25.6676296Z [118/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:53:25.6697437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6699347Z 2025-05-07T19:53:25.6701045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6703073Z 2025-05-07T19:53:25.6704779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6706535Z 2025-05-07T19:53:25.6708034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6709634Z 2025-05-07T19:53:25.6711111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6712879Z 2025-05-07T19:53:25.6714358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:25.6716546Z 2025-05-07T19:53:26.7035913Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:53:26.7056834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.7058682Z 2025-05-07T19:53:26.7060280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.7061979Z 2025-05-07T19:53:26.7063186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.7064618Z 2025-05-07T19:53:26.7066194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.7067974Z 2025-05-07T19:53:26.7069560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.7071340Z 2025-05-07T19:53:26.7073443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.7075240Z 2025-05-07T19:53:27.6317558Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:27.6339715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6341378Z 2025-05-07T19:53:27.6342768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6344371Z 2025-05-07T19:53:27.6345950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6347815Z 2025-05-07T19:53:27.6349463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6351331Z 2025-05-07T19:53:27.6353521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6355491Z 2025-05-07T19:53:27.6357117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.6359096Z 2025-05-07T19:53:28.1735685Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:53:28.1758970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.1760898Z 2025-05-07T19:53:28.1762588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.1764434Z 2025-05-07T19:53:28.1766033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.1767882Z 2025-05-07T19:53:28.1769885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.1771765Z 2025-05-07T19:53:28.1773363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.1775184Z 2025-05-07T19:53:28.1776869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.1778708Z 2025-05-07T19:53:29.5718723Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:53:29.5737758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.5739426Z 2025-05-07T19:53:29.5740702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.5742426Z 2025-05-07T19:53:29.5743910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.5745632Z 2025-05-07T19:53:29.5747038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.5748998Z 2025-05-07T19:53:29.5750264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.5751810Z 2025-05-07T19:53:29.5753563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.5754849Z 2025-05-07T19:53:38.3099546Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:53:38.3121237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:38.3122926Z 2025-05-07T19:53:38.3124459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:38.3126285Z 2025-05-07T19:53:38.3127878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:38.3129661Z 2025-05-07T19:53:38.3131719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:38.3133403Z 2025-05-07T19:53:38.3134952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:38.3136711Z 2025-05-07T19:53:38.3138226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:38.3140162Z 2025-05-07T19:53:38.9900245Z [124/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:45.8392400Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:53:45.8415953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:45.8417882Z 2025-05-07T19:53:45.8419634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:45.8421559Z 2025-05-07T19:53:45.8423243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:45.8425169Z 2025-05-07T19:53:45.8426851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:45.8428814Z 2025-05-07T19:53:45.8430510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:45.8432859Z 2025-05-07T19:53:45.8434602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:45.8436522Z 2025-05-07T19:53:46.4513298Z [126/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:47.2780497Z [127/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:53:47.2805131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.2807077Z 2025-05-07T19:53:47.2808827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.2810672Z 2025-05-07T19:53:47.2812317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.2814181Z 2025-05-07T19:53:47.2815834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.2817782Z 2025-05-07T19:53:47.2819476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.2821386Z 2025-05-07T19:53:47.2823023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:47.2824864Z 2025-05-07T19:53:48.8225100Z [128/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:53:48.8241664Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:48.9372643Z [129/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:48.9393932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:48.9395690Z 2025-05-07T19:53:48.9397255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:48.9398971Z 2025-05-07T19:53:48.9400766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:48.9402528Z 2025-05-07T19:53:48.9404028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:48.9405752Z 2025-05-07T19:53:48.9407279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:48.9408973Z 2025-05-07T19:53:48.9410499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:48.9412457Z 2025-05-07T19:53:49.5572200Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:49.5595201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5597039Z 2025-05-07T19:53:49.5598955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5600813Z 2025-05-07T19:53:49.5602332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5604103Z 2025-05-07T19:53:49.5605665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5607366Z 2025-05-07T19:53:49.5608867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5610842Z 2025-05-07T19:53:49.5612437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5614215Z 2025-05-07T19:53:49.5615757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5617514Z 2025-05-07T19:53:49.5619072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5620811Z 2025-05-07T19:53:49.5622354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5624078Z 2025-05-07T19:53:49.5625463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5627104Z 2025-05-07T19:53:49.5628673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5630473Z 2025-05-07T19:53:49.5632092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:49.5633991Z 2025-05-07T19:53:49.5635509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5637252Z 2025-05-07T19:53:49.5638792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5640544Z 2025-05-07T19:53:49.5642060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:49.5643780Z 2025-05-07T19:53:49.5797059Z [131/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T19:53:50.2854506Z [132/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T19:53:52.8100322Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:52.8122963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.8124875Z 2025-05-07T19:53:52.8126835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.8128778Z 2025-05-07T19:53:52.8130365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8131790Z 2025-05-07T19:53:52.8133095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8134777Z 2025-05-07T19:53:52.8136277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8138281Z 2025-05-07T19:53:52.8139747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8141413Z 2025-05-07T19:53:52.8143035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.8144884Z 2025-05-07T19:53:52.8146502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.8148353Z 2025-05-07T19:53:52.8149866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8151603Z 2025-05-07T19:53:52.8153301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8155068Z 2025-05-07T19:53:52.8156332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8157822Z 2025-05-07T19:53:52.8159333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8161075Z 2025-05-07T19:53:52.8162679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.8164494Z 2025-05-07T19:53:52.8166077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:52.8167929Z 2025-05-07T19:53:52.8169454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8171157Z 2025-05-07T19:53:52.8172683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8174418Z 2025-05-07T19:53:52.8176113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8177917Z 2025-05-07T19:53:52.8179492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:52.8180891Z 2025-05-07T19:53:53.2825459Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:53.2848476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.2850326Z 2025-05-07T19:53:53.2851978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.2853867Z 2025-05-07T19:53:53.2855424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2857220Z 2025-05-07T19:53:53.2859047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2860863Z 2025-05-07T19:53:53.2862514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2864124Z 2025-05-07T19:53:53.2865435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2867027Z 2025-05-07T19:53:53.2868637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.2870661Z 2025-05-07T19:53:53.2872247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.2874199Z 2025-05-07T19:53:53.2875716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2877482Z 2025-05-07T19:53:53.2879029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2880795Z 2025-05-07T19:53:53.2882352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2884024Z 2025-05-07T19:53:53.2885595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2887688Z 2025-05-07T19:53:53.2889050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.2890692Z 2025-05-07T19:53:53.2892276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.2894102Z 2025-05-07T19:53:53.2895645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2897331Z 2025-05-07T19:53:53.2898818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2900579Z 2025-05-07T19:53:53.2902110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2903873Z 2025-05-07T19:53:53.2905752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.2907490Z 2025-05-07T19:53:59.8574998Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:59.8597106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:59.8598882Z 2025-05-07T19:53:59.8600512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:59.8602431Z 2025-05-07T19:53:59.8604158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:59.8606108Z 2025-05-07T19:53:59.8607886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:59.8609930Z 2025-05-07T19:53:59.8611497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:59.8613231Z 2025-05-07T19:53:59.8615132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:59.8616765Z 2025-05-07T19:54:00.5941064Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:54:00.5962940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5964827Z 2025-05-07T19:54:00.5966568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5968449Z 2025-05-07T19:54:00.5969834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5971327Z 2025-05-07T19:54:00.5972843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5974448Z 2025-05-07T19:54:00.5976258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5977955Z 2025-05-07T19:54:00.5979401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5981107Z 2025-05-07T19:54:01.1506901Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:54:01.1529137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.1530802Z 2025-05-07T19:54:01.1532149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.1533679Z 2025-05-07T19:54:01.1535189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.1536824Z 2025-05-07T19:54:01.1538396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.1540295Z 2025-05-07T19:54:01.1542288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.1544209Z 2025-05-07T19:54:01.1545915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.1547745Z 2025-05-07T19:54:02.9272812Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:02.9294298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.9296141Z 2025-05-07T19:54:02.9297731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.9299528Z 2025-05-07T19:54:02.9301147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.9303067Z 2025-05-07T19:54:02.9305095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.9306921Z 2025-05-07T19:54:02.9308579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.9310470Z 2025-05-07T19:54:02.9312150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:02.9314138Z 2025-05-07T19:54:06.5734237Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:54:06.5754946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:06.5756869Z 2025-05-07T19:54:06.5758564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:06.5760367Z 2025-05-07T19:54:06.5761738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:06.5763422Z 2025-05-07T19:54:06.5765290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:06.5766959Z 2025-05-07T19:54:06.5768562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:06.5770386Z 2025-05-07T19:54:06.5772016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:06.5773927Z 2025-05-07T19:54:07.2473247Z [140/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:54:07.9020002Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:54:09.1948184Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:54:09.1971377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.1973345Z 2025-05-07T19:54:09.1975046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.1977011Z 2025-05-07T19:54:09.1978677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.1980995Z 2025-05-07T19:54:09.1982638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.1984523Z 2025-05-07T19:54:09.1986522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.1988435Z 2025-05-07T19:54:09.1990043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.1991961Z 2025-05-07T19:54:16.4588578Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:16.4609701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.4611586Z 2025-05-07T19:54:16.4613060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.4615374Z 2025-05-07T19:54:16.4616969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.4618743Z 2025-05-07T19:54:16.4620468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.4622386Z 2025-05-07T19:54:16.4624031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.4625885Z 2025-05-07T19:54:16.4627367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.4628974Z 2025-05-07T19:54:16.8554319Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:16.8575193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.8576845Z 2025-05-07T19:54:16.8578297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.8580463Z 2025-05-07T19:54:16.8581972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.8583746Z 2025-05-07T19:54:16.8585289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.8587413Z 2025-05-07T19:54:16.8588812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.8590361Z 2025-05-07T19:54:16.8591775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:16.8593651Z 2025-05-07T19:54:28.0613061Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:28.0634844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.0636924Z 2025-05-07T19:54:28.0638463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.0640144Z 2025-05-07T19:54:28.0641649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.0643462Z 2025-05-07T19:54:28.0645116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.0646978Z 2025-05-07T19:54:28.0648622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.0650030Z 2025-05-07T19:54:28.0651495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.0653251Z 2025-05-07T19:54:28.2084550Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:54:28.2105768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2107987Z 2025-05-07T19:54:28.2109576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2111263Z 2025-05-07T19:54:28.2112917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2114590Z 2025-05-07T19:54:28.2115951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2117474Z 2025-05-07T19:54:28.2119022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2120894Z 2025-05-07T19:54:28.2122357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.2123969Z 2025-05-07T19:54:36.8371099Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:36.8391832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.8393605Z 2025-05-07T19:54:36.8395343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.8397196Z 2025-05-07T19:54:36.8398836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8400602Z 2025-05-07T19:54:36.8401959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8403573Z 2025-05-07T19:54:36.8405046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8406750Z 2025-05-07T19:54:36.8408289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.8410028Z 2025-05-07T19:54:36.8411465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.8413469Z 2025-05-07T19:54:36.8414990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8416504Z 2025-05-07T19:54:36.8417893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8419443Z 2025-05-07T19:54:36.8420835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8422520Z 2025-05-07T19:54:36.8427958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.8429746Z 2025-05-07T19:54:36.8431007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.8433019Z 2025-05-07T19:54:36.8434310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8435885Z 2025-05-07T19:54:36.8437323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8439350Z 2025-05-07T19:54:36.8440819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:36.8442433Z 2025-05-07T19:54:38.0062055Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:38.0083629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.0085400Z 2025-05-07T19:54:38.0087026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.0088603Z 2025-05-07T19:54:38.0090156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.0091908Z 2025-05-07T19:54:38.0093493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.0095688Z 2025-05-07T19:54:38.0097165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.0098882Z 2025-05-07T19:54:38.0100406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:38.0102039Z 2025-05-07T19:54:38.1351825Z [149/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:38.1369749Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:39.3068144Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:39.3094731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3096700Z 2025-05-07T19:54:39.3098405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3100313Z 2025-05-07T19:54:39.3101962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3103974Z 2025-05-07T19:54:39.3105760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3107676Z 2025-05-07T19:54:39.3109332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3111221Z 2025-05-07T19:54:39.3113083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.3114912Z 2025-05-07T19:54:41.7394050Z [151/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:41.7413821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.7415613Z 2025-05-07T19:54:41.7417108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.7418881Z 2025-05-07T19:54:41.7420068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7424184Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7427243Z (955): here 2025-05-07T19:54:41.7427426Z 2025-05-07T19:54:41.7428545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7433061Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7436132Z (1007): here 2025-05-07T19:54:41.7436342Z 2025-05-07T19:54:41.7437425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7441359Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7444610Z (1059): here 2025-05-07T19:54:41.7444825Z 2025-05-07T19:54:41.7445976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7450090Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7453214Z (1111): here 2025-05-07T19:54:41.7453427Z 2025-05-07T19:54:41.7454518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7458352Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7461269Z (1163): here 2025-05-07T19:54:41.7461481Z 2025-05-07T19:54:41.7462566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7466538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7469717Z (1215): here 2025-05-07T19:54:41.7469914Z 2025-05-07T19:54:41.7471030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7479795Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7483141Z (1267): here 2025-05-07T19:54:41.7483376Z 2025-05-07T19:54:41.7484496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7488750Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7491981Z (1319): here 2025-05-07T19:54:41.7492201Z 2025-05-07T19:54:41.7493276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7497304Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7500590Z (1371): here 2025-05-07T19:54:41.7500805Z 2025-05-07T19:54:41.7502004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7506088Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7509457Z (1423): here 2025-05-07T19:54:41.7509690Z 2025-05-07T19:54:41.7511048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7515618Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7518700Z (1475): here 2025-05-07T19:54:41.7518922Z 2025-05-07T19:54:41.7520297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7525160Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7528409Z (1527): here 2025-05-07T19:54:41.7528649Z 2025-05-07T19:54:41.7529933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7533946Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7537246Z (1579): here 2025-05-07T19:54:41.7537457Z 2025-05-07T19:54:41.7538616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7542605Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7545696Z (1631): here 2025-05-07T19:54:41.7545945Z 2025-05-07T19:54:41.7547214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7551649Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7554932Z (1683): here 2025-05-07T19:54:41.7555127Z 2025-05-07T19:54:41.7556305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7560726Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7564043Z (1735): here 2025-05-07T19:54:41.7564268Z 2025-05-07T19:54:41.7565453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7570248Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7573330Z (1787): here 2025-05-07T19:54:41.7573542Z 2025-05-07T19:54:41.7574768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7579099Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7582367Z (1839): here 2025-05-07T19:54:41.7582569Z 2025-05-07T19:54:41.7583687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7588213Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7591390Z (1891): here 2025-05-07T19:54:41.7591591Z 2025-05-07T19:54:41.7592901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7597004Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7600372Z (1943): here 2025-05-07T19:54:41.7600604Z 2025-05-07T19:54:41.7601961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7606313Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7609704Z (1995): here 2025-05-07T19:54:41.7609938Z 2025-05-07T19:54:41.7611181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7615637Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7618777Z (2047): here 2025-05-07T19:54:41.7618971Z 2025-05-07T19:54:41.7620151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7624357Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7627712Z (2099): here 2025-05-07T19:54:41.7627916Z 2025-05-07T19:54:41.7629066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7633334Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7636477Z (2151): here 2025-05-07T19:54:41.7636685Z 2025-05-07T19:54:41.7638186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.7639917Z 2025-05-07T19:54:41.7641534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.7643357Z 2025-05-07T19:54:41.7644568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7648765Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7651777Z (955): here 2025-05-07T19:54:41.7651987Z 2025-05-07T19:54:41.7653185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7657387Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7660496Z (1007): here 2025-05-07T19:54:41.7660722Z 2025-05-07T19:54:41.7662228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7666457Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7669641Z (1059): here 2025-05-07T19:54:41.7669875Z 2025-05-07T19:54:41.7670947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7675538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7678588Z (1111): here 2025-05-07T19:54:41.7678795Z 2025-05-07T19:54:41.7679975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7683957Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7687242Z (1163): here 2025-05-07T19:54:41.7687445Z 2025-05-07T19:54:41.7688542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7692595Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7695631Z (1215): here 2025-05-07T19:54:41.7695831Z 2025-05-07T19:54:41.7696956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7700924Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7704029Z (1267): here 2025-05-07T19:54:41.7704259Z 2025-05-07T19:54:41.7705387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7709945Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7713161Z (1319): here 2025-05-07T19:54:41.7713399Z 2025-05-07T19:54:41.7714537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7718516Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7722154Z (1371): here 2025-05-07T19:54:41.7722386Z 2025-05-07T19:54:41.7723675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7727914Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7731171Z (1423): here 2025-05-07T19:54:41.7731389Z 2025-05-07T19:54:41.7732500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7736731Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7739793Z (1475): here 2025-05-07T19:54:41.7739997Z 2025-05-07T19:54:41.7741172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7745228Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7748140Z (1527): here 2025-05-07T19:54:41.7748354Z 2025-05-07T19:54:41.7749539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7753942Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7757164Z (1579): here 2025-05-07T19:54:41.7757365Z 2025-05-07T19:54:41.7758758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7763456Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7766659Z (1631): here 2025-05-07T19:54:41.7766874Z 2025-05-07T19:54:41.7768062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7772284Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7775639Z (1683): here 2025-05-07T19:54:41.7775859Z 2025-05-07T19:54:41.7777047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7781389Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7784454Z (1735): here 2025-05-07T19:54:41.7784671Z 2025-05-07T19:54:41.7786080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7790383Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7793693Z (1787): here 2025-05-07T19:54:41.7793899Z 2025-05-07T19:54:41.7795022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7799512Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7802646Z (1839): here 2025-05-07T19:54:41.7802868Z 2025-05-07T19:54:41.7804019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7808348Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7811966Z (1891): here 2025-05-07T19:54:41.7812189Z 2025-05-07T19:54:41.7813439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7817758Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7821080Z (1943): here 2025-05-07T19:54:41.7821296Z 2025-05-07T19:54:41.7822436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7826901Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7830188Z (1995): here 2025-05-07T19:54:41.7830434Z 2025-05-07T19:54:41.7831654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7835846Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7839119Z (2047): here 2025-05-07T19:54:41.7839322Z 2025-05-07T19:54:41.7840464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7844836Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7848176Z (2099): here 2025-05-07T19:54:41.7848383Z 2025-05-07T19:54:41.7849615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7853773Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7857205Z (2151): here 2025-05-07T19:54:41.7857425Z 2025-05-07T19:54:41.7859036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.7860874Z 2025-05-07T19:54:41.7862519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:41.7864186Z 2025-05-07T19:54:41.7865338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7869540Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7872870Z (955): here 2025-05-07T19:54:41.7873074Z 2025-05-07T19:54:41.7874209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7878566Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7881616Z (1007): here 2025-05-07T19:54:41.7881841Z 2025-05-07T19:54:41.7883062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7888023Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7891429Z (1059): here 2025-05-07T19:54:41.7891679Z 2025-05-07T19:54:41.7892914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7897601Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7900896Z (1111): here 2025-05-07T19:54:41.7901163Z 2025-05-07T19:54:41.7902436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7906703Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7909964Z (1163): here 2025-05-07T19:54:41.7910181Z 2025-05-07T19:54:41.7911469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7916051Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7919126Z (1215): here 2025-05-07T19:54:41.7919331Z 2025-05-07T19:54:41.7920579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7924874Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7927922Z (1267): here 2025-05-07T19:54:41.7928147Z 2025-05-07T19:54:41.7929381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7933566Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7936444Z (1319): here 2025-05-07T19:54:41.7936665Z 2025-05-07T19:54:41.7938281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7942487Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7945686Z (1371): here 2025-05-07T19:54:41.7945892Z 2025-05-07T19:54:41.7947032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7951681Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7954911Z (1423): here 2025-05-07T19:54:41.7955114Z 2025-05-07T19:54:41.7956242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7960388Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7963557Z (1475): here 2025-05-07T19:54:41.7963761Z 2025-05-07T19:54:41.7964922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7969082Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7972156Z (1527): here 2025-05-07T19:54:41.7972363Z 2025-05-07T19:54:41.7973422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7977438Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7980533Z (1579): here 2025-05-07T19:54:41.7980736Z 2025-05-07T19:54:41.7982186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7986534Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7989648Z (1631): here 2025-05-07T19:54:41.7989864Z 2025-05-07T19:54:41.7991062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.7995633Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.7998811Z (1683): here 2025-05-07T19:54:41.7999031Z 2025-05-07T19:54:41.8000313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8004189Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8007316Z (1735): here 2025-05-07T19:54:41.8007530Z 2025-05-07T19:54:41.8008696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8013023Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8016276Z (1787): here 2025-05-07T19:54:41.8016506Z 2025-05-07T19:54:41.8017637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8021691Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8024756Z (1839): here 2025-05-07T19:54:41.8024958Z 2025-05-07T19:54:41.8026523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8030680Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8033898Z (1891): here 2025-05-07T19:54:41.8034082Z 2025-05-07T19:54:41.8035217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8042492Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8045632Z (1943): here 2025-05-07T19:54:41.8045863Z 2025-05-07T19:54:41.8047081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8051555Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8054824Z (1995): here 2025-05-07T19:54:41.8055062Z 2025-05-07T19:54:41.8056249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8060647Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8063858Z (2047): here 2025-05-07T19:54:41.8064075Z 2025-05-07T19:54:41.8065249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8069506Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8072784Z (2099): here 2025-05-07T19:54:41.8072999Z 2025-05-07T19:54:41.8074150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:41.8078573Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:41.8081606Z (2151): here 2025-05-07T19:54:41.8081824Z 2025-05-07T19:54:42.0332955Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:42.0354413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0356268Z 2025-05-07T19:54:42.0357793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0359452Z 2025-05-07T19:54:42.0360839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0362579Z 2025-05-07T19:54:42.0364533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0366379Z 2025-05-07T19:54:42.0367827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0369413Z 2025-05-07T19:54:42.0370872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0372530Z 2025-05-07T19:54:43.3563513Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:54:43.3585651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.3587754Z 2025-05-07T19:54:43.3602033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.3603917Z 2025-05-07T19:54:43.3605938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3607681Z 2025-05-07T19:54:43.3609126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3610913Z 2025-05-07T19:54:43.3612393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3614102Z 2025-05-07T19:54:43.3615690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.3617730Z 2025-05-07T19:54:43.3619337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.3621125Z 2025-05-07T19:54:43.3622598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3624376Z 2025-05-07T19:54:43.3625759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3627390Z 2025-05-07T19:54:43.3628956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3630518Z 2025-05-07T19:54:43.3632089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.3633976Z 2025-05-07T19:54:43.3635467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.3637250Z 2025-05-07T19:54:43.3638738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3640399Z 2025-05-07T19:54:43.3641951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3643608Z 2025-05-07T19:54:43.3645116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:43.3646859Z 2025-05-07T19:54:43.4103847Z [154/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:43.4125147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.4126966Z 2025-05-07T19:54:43.4128500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.4130299Z 2025-05-07T19:54:43.4131823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.4133630Z 2025-05-07T19:54:43.4135187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.4136982Z 2025-05-07T19:54:43.4138508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.4140259Z 2025-05-07T19:54:43.4141851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.4143617Z 2025-05-07T19:54:45.5973496Z [155/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:45.5997922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.5999914Z 2025-05-07T19:54:45.6001604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6003483Z 2025-05-07T19:54:45.6005052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6006807Z 2025-05-07T19:54:45.6008482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6010294Z 2025-05-07T19:54:45.6011886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6013618Z 2025-05-07T19:54:45.6015148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6016982Z 2025-05-07T19:54:45.6019018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6020911Z 2025-05-07T19:54:45.6022611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6024799Z 2025-05-07T19:54:45.6026254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6028098Z 2025-05-07T19:54:45.6029690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6031808Z 2025-05-07T19:54:45.6033629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6035376Z 2025-05-07T19:54:45.6036960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6038692Z 2025-05-07T19:54:45.6040434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6042450Z 2025-05-07T19:54:45.6044414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6046314Z 2025-05-07T19:54:45.6047887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6049698Z 2025-05-07T19:54:45.6051253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6052980Z 2025-05-07T19:54:45.6054480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6056252Z 2025-05-07T19:54:45.6057776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6059507Z 2025-05-07T19:54:45.6660430Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:45.6685113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6687268Z 2025-05-07T19:54:45.6688914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6690771Z 2025-05-07T19:54:45.6692360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6694243Z 2025-05-07T19:54:45.6695889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6697723Z 2025-05-07T19:54:45.6699261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6701067Z 2025-05-07T19:54:45.6702618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6704425Z 2025-05-07T19:54:45.6706120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6708047Z 2025-05-07T19:54:45.6709743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6711708Z 2025-05-07T19:54:45.6713893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6715670Z 2025-05-07T19:54:45.6717307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6719043Z 2025-05-07T19:54:45.6720637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6722450Z 2025-05-07T19:54:45.6724321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6726162Z 2025-05-07T19:54:45.6727871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6729763Z 2025-05-07T19:54:45.6731506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.6733469Z 2025-05-07T19:54:45.6735087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6736940Z 2025-05-07T19:54:45.6738606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6740380Z 2025-05-07T19:54:45.6742083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6744008Z 2025-05-07T19:54:45.6745693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.6747622Z 2025-05-07T19:54:48.1823727Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:48.1844956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.1846769Z 2025-05-07T19:54:48.1848575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.1850331Z 2025-05-07T19:54:48.1851756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.1853663Z 2025-05-07T19:54:48.1855401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.1857315Z 2025-05-07T19:54:48.1858998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.1860845Z 2025-05-07T19:54:48.1862518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:48.1864205Z 2025-05-07T19:54:49.1901079Z [158/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:49.1924189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.1926132Z 2025-05-07T19:54:49.1927851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.1929734Z 2025-05-07T19:54:49.1931465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.1933307Z 2025-05-07T19:54:49.1935265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.1937220Z 2025-05-07T19:54:49.1938917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.1940751Z 2025-05-07T19:54:49.1942142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.1943909Z 2025-05-07T19:54:51.1502862Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:51.1526543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.1528246Z 2025-05-07T19:54:51.1529780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.1531666Z 2025-05-07T19:54:51.1533391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.1535305Z 2025-05-07T19:54:51.1536922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.1538637Z 2025-05-07T19:54:51.1540109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.1542014Z 2025-05-07T19:54:51.1543708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.1545649Z 2025-05-07T19:54:51.9082298Z [160/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:51.9105854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9107858Z 2025-05-07T19:54:51.9109375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9111162Z 2025-05-07T19:54:51.9112942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9114585Z 2025-05-07T19:54:51.9115899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9117465Z 2025-05-07T19:54:51.9119022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9120723Z 2025-05-07T19:54:51.9122150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9123722Z 2025-05-07T19:54:54.4317227Z [161/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:54.4335991Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.8095883Z [162/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:54.8112409Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.8114184Z [163/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:55.8134787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.8136517Z 2025-05-07T19:54:55.8137984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.8139728Z 2025-05-07T19:54:55.8141165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8142794Z 2025-05-07T19:54:55.8144246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8145915Z 2025-05-07T19:54:55.8147369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8149064Z 2025-05-07T19:54:55.8150576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8152357Z 2025-05-07T19:54:55.8154149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.8155867Z 2025-05-07T19:54:55.8157265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.8158999Z 2025-05-07T19:54:55.8160172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8161692Z 2025-05-07T19:54:55.8162786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8164538Z 2025-05-07T19:54:55.8166043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8167501Z 2025-05-07T19:54:55.8168851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8170259Z 2025-05-07T19:54:55.8171694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.8173350Z 2025-05-07T19:54:55.8174829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.8176372Z 2025-05-07T19:54:55.8177489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8178948Z 2025-05-07T19:54:55.8180155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8181687Z 2025-05-07T19:54:55.8183108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8184663Z 2025-05-07T19:54:55.8186255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:55.8187812Z 2025-05-07T19:54:56.0920232Z [164/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:56.0940507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.0942244Z 2025-05-07T19:54:56.0943692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.0945318Z 2025-05-07T19:54:56.0946805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.0948378Z 2025-05-07T19:54:56.0949577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.0951183Z 2025-05-07T19:54:56.0952708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.0954397Z 2025-05-07T19:54:56.0955834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.0957453Z 2025-05-07T19:54:56.1215985Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.1233433Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.1514338Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.1531613Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.1811285Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.1830037Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.2107125Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.2124216Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.2406113Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.2423813Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.2699192Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.2717210Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.2997218Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.3015152Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.3290120Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.3307397Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.3584194Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.3602085Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.3880664Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.3899325Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.4175372Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.4193389Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.4470555Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.4488139Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.4768214Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:56.4786695Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.5894486Z [178/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:56.5911730Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:56.8711213Z [179/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:56.8732329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.8734022Z 2025-05-07T19:54:56.8735149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.8736484Z 2025-05-07T19:54:56.8737812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.8739724Z 2025-05-07T19:54:56.8741429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.8743052Z 2025-05-07T19:54:56.8744476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.8746086Z 2025-05-07T19:54:56.8747805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.8749465Z 2025-05-07T19:54:56.8858974Z [180/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:56.8876739Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.6987064Z [181/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:57.7007748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.7009536Z 2025-05-07T19:54:57.7011197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.7012768Z 2025-05-07T19:54:57.7014281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7015769Z 2025-05-07T19:54:57.7016882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7018707Z 2025-05-07T19:54:57.7019711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7021036Z 2025-05-07T19:54:57.7022633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7024113Z 2025-05-07T19:54:57.7025582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.7027332Z 2025-05-07T19:54:57.7029001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.7030586Z 2025-05-07T19:54:57.7031823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7033364Z 2025-05-07T19:54:57.7034582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7035913Z 2025-05-07T19:54:57.7037276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7038964Z 2025-05-07T19:54:57.7040510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7042345Z 2025-05-07T19:54:57.7044079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.7045824Z 2025-05-07T19:54:57.7047563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.7050520Z 2025-05-07T19:54:57.7051908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7053358Z 2025-05-07T19:54:57.7054616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7056081Z 2025-05-07T19:54:57.7057390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7058949Z 2025-05-07T19:54:57.7060336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.7061938Z 2025-05-07T19:54:59.1783686Z [182/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:59.1802471Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.6407642Z [183/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:59.6427592Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.3589710Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:55:00.3609405Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.6668379Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:55:00.6687673Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.8472135Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:55:00.8489240Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.5245356Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:55:01.5263618Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.5539727Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.5555838Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.5836311Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.5857216Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.6130112Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.6151243Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.6427587Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.6448412Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.6864880Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:55:01.6885040Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.6906215Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.6927280Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.7166964Z [194/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.7187780Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.1401228Z [195/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:55:02.1422229Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.8286084Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:02.8306643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8308429Z 2025-05-07T19:55:02.8310075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8311818Z 2025-05-07T19:55:02.8313544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8315351Z 2025-05-07T19:55:02.8316953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8318628Z 2025-05-07T19:55:02.8320191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8321951Z 2025-05-07T19:55:02.8323724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8325536Z 2025-05-07T19:55:02.8867373Z [197/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:02.8890668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8892485Z 2025-05-07T19:55:02.8894122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8896031Z 2025-05-07T19:55:02.8897651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8899285Z 2025-05-07T19:55:02.8900787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8902442Z 2025-05-07T19:55:02.8904045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8905736Z 2025-05-07T19:55:02.8907264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8908945Z 2025-05-07T19:55:02.8910542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8912421Z 2025-05-07T19:55:02.8914133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8915876Z 2025-05-07T19:55:02.8917428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8919235Z 2025-05-07T19:55:02.8921155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8922817Z 2025-05-07T19:55:02.8924353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8926160Z 2025-05-07T19:55:02.8927756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.8929488Z 2025-05-07T19:55:02.8930972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8932953Z 2025-05-07T19:55:02.8934397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8936177Z 2025-05-07T19:55:02.8937693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:02.8939449Z 2025-05-07T19:55:05.1389718Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:55:05.1412234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.1413909Z 2025-05-07T19:55:05.1415540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.1417217Z 2025-05-07T19:55:05.1418855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.1420646Z 2025-05-07T19:55:05.1422395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.1424153Z 2025-05-07T19:55:05.1425757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.1427467Z 2025-05-07T19:55:05.1429049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.1430906Z 2025-05-07T19:55:06.0841558Z [199/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:55:06.0859901Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.4451797Z [200/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:55:06.4476777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:06.4478877Z 2025-05-07T19:55:06.4480709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:06.4482624Z 2025-05-07T19:55:06.4484454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:06.4486803Z 2025-05-07T19:55:06.4488633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:06.4490696Z 2025-05-07T19:55:06.4492487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:06.4494719Z 2025-05-07T19:55:06.4496823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:06.4498887Z 2025-05-07T19:55:06.5149501Z [201/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:55:06.5169147Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.8942211Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:55:06.8956079Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:08.1066773Z [203/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:55:08.1085540Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.9071338Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:55:09.9090490Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.7028843Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:11.7048176Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:13.4715487Z [206/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:13.4734950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.4736689Z 2025-05-07T19:55:13.4738015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.4739752Z 2025-05-07T19:55:13.4741255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.4742880Z 2025-05-07T19:55:13.4744397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.4746128Z 2025-05-07T19:55:13.4747640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.4749374Z 2025-05-07T19:55:13.4751002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.4753060Z 2025-05-07T19:55:13.5608803Z [207/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:13.5626350Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.1518085Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:55:14.1537851Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.2838636Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:14.2852717Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.6238658Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:14.6255508Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.5344875Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:15.5361921Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.5962325Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:15.5981267Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.7596968Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:55:15.7617652Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:16.8102185Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:16.8121324Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.0791297Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:17.0812180Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.3376326Z [216/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:17.3394180Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.4251383Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:17.4270245Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.5289363Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:55:17.5307854Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.6922615Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:17.6940741Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.7539358Z [220/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:17.7561698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7563366Z 2025-05-07T19:55:17.7565195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7567071Z 2025-05-07T19:55:17.7568544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7570133Z 2025-05-07T19:55:17.7571635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7573205Z 2025-05-07T19:55:17.7574457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7576262Z 2025-05-07T19:55:17.7577687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7579291Z 2025-05-07T19:55:18.0007366Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:18.0023773Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:18.0948337Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:18.0967507Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:18.4929893Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:18.4946212Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.0195469Z [224/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:55:19.0214823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:19.0216209Z 2025-05-07T19:55:19.0217390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:19.0218738Z 2025-05-07T19:55:19.0219881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0221127Z 2025-05-07T19:55:19.0222348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0223688Z 2025-05-07T19:55:19.0225147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0226546Z 2025-05-07T19:55:19.0227693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:19.0229008Z 2025-05-07T19:55:19.0230233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:19.0231503Z 2025-05-07T19:55:19.0232778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0234307Z 2025-05-07T19:55:19.0235445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0236676Z 2025-05-07T19:55:19.0237717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0238928Z 2025-05-07T19:55:19.0240014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:19.0241248Z 2025-05-07T19:55:19.0242388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:19.0243632Z 2025-05-07T19:55:19.0244682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0245894Z 2025-05-07T19:55:19.0246943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0248137Z 2025-05-07T19:55:19.0249221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:19.0250432Z 2025-05-07T19:55:19.7977406Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:19.7993566Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.2250803Z [226/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:20.2266233Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:21.2024953Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:21.2042845Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:21.7753817Z [228/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:21.7772161Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:22.7032769Z [229/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:23.2435988Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:23.2456364Z [230/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:23.2476644Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:23.8322167Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:23.8340218Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:25.0327188Z [232/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:25.0344871Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:25.4804446Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:25.4822981Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:25.5676034Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:25.5695914Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:25.7384784Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:25.7403673Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.1771198Z [236/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:26.1792714Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.9815561Z [237/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:26.9835617Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:27.5355430Z [238/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:27.5377231Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:27.8109324Z [239/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T19:55:28.1792910Z [240/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:28.1813308Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:28.7950658Z [241/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:28.7969170Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:29.0705033Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:55:29.0720905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0722199Z 2025-05-07T19:55:29.0723303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0724559Z 2025-05-07T19:55:29.0725572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0726705Z 2025-05-07T19:55:29.0727734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0728939Z 2025-05-07T19:55:29.0730002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0731254Z 2025-05-07T19:55:29.0732317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0733530Z 2025-05-07T19:55:29.0734683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0735981Z 2025-05-07T19:55:29.0737152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0738486Z 2025-05-07T19:55:29.0739787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0741035Z 2025-05-07T19:55:29.0742092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0743277Z 2025-05-07T19:55:29.0744350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0745534Z 2025-05-07T19:55:29.0746548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0747851Z 2025-05-07T19:55:29.0748933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0750174Z 2025-05-07T19:55:29.0751259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0752631Z 2025-05-07T19:55:29.0753654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0754781Z 2025-05-07T19:55:29.0755790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0756936Z 2025-05-07T19:55:29.0757925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0759070Z 2025-05-07T19:55:29.0760058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0761175Z 2025-05-07T19:55:29.0923910Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:55:29.0943184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0944712Z 2025-05-07T19:55:29.0946062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0947646Z 2025-05-07T19:55:29.0948896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0950441Z 2025-05-07T19:55:29.0951734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0953339Z 2025-05-07T19:55:29.0954637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0956061Z 2025-05-07T19:55:29.0957318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0958794Z 2025-05-07T19:55:29.0960144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0961699Z 2025-05-07T19:55:29.0963020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0964583Z 2025-05-07T19:55:29.0965891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0967322Z 2025-05-07T19:55:29.0968578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0970013Z 2025-05-07T19:55:29.0971502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0972907Z 2025-05-07T19:55:29.0974144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0975554Z 2025-05-07T19:55:29.0976877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0978424Z 2025-05-07T19:55:29.0979810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0981595Z 2025-05-07T19:55:29.0982848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0984322Z 2025-05-07T19:55:29.0985603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0987313Z 2025-05-07T19:55:29.0988601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0990043Z 2025-05-07T19:55:29.0991340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0992927Z 2025-05-07T19:55:29.1857271Z [244/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:29.1872313Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:29.4510403Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:55:29.4527660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.4529079Z 2025-05-07T19:55:29.4530230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.4531623Z 2025-05-07T19:55:29.4532691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4533976Z 2025-05-07T19:55:29.4535136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4536764Z 2025-05-07T19:55:29.4537872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4539181Z 2025-05-07T19:55:29.4540313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4541616Z 2025-05-07T19:55:29.4542874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.4544514Z 2025-05-07T19:55:29.4545695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.4547120Z 2025-05-07T19:55:29.4548304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4549633Z 2025-05-07T19:55:29.4550775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4552146Z 2025-05-07T19:55:29.4553463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4554742Z 2025-05-07T19:55:29.4555960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4557367Z 2025-05-07T19:55:29.4558604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.4560008Z 2025-05-07T19:55:29.4561184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.4562551Z 2025-05-07T19:55:29.4563684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4565009Z 2025-05-07T19:55:29.4566178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4567482Z 2025-05-07T19:55:29.4568625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4569922Z 2025-05-07T19:55:29.4571082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.4572392Z 2025-05-07T19:55:29.6498581Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:55:29.6515319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.6516738Z 2025-05-07T19:55:29.6517978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.6519393Z 2025-05-07T19:55:29.6520636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.6522021Z 2025-05-07T19:55:29.6523260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.6524669Z 2025-05-07T19:55:29.6525893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.6527279Z 2025-05-07T19:55:29.6528663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.6530080Z 2025-05-07T19:55:30.4897798Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:30.4923092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4925190Z 2025-05-07T19:55:30.4927032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.4929140Z 2025-05-07T19:55:30.4930568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4935326Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4938912Z (946): here 2025-05-07T19:55:30.4939182Z 2025-05-07T19:55:30.4940924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4945762Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4949347Z (996): here 2025-05-07T19:55:30.4949593Z 2025-05-07T19:55:30.4950948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4956107Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4959555Z (1046): here 2025-05-07T19:55:30.4959807Z 2025-05-07T19:55:30.4961163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4966003Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4969574Z (1096): here 2025-05-07T19:55:30.4969843Z 2025-05-07T19:55:30.4971254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4976050Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4979571Z (1146): here 2025-05-07T19:55:30.4979818Z 2025-05-07T19:55:30.4981236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4986006Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4989429Z (1196): here 2025-05-07T19:55:30.4989681Z 2025-05-07T19:55:30.4991092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.4996193Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.4999732Z (1246): here 2025-05-07T19:55:30.4999983Z 2025-05-07T19:55:30.5001402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5006112Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5009900Z (1296): here 2025-05-07T19:55:30.5010148Z 2025-05-07T19:55:30.5011523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5016193Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5019754Z (1346): here 2025-05-07T19:55:30.5020024Z 2025-05-07T19:55:30.5021431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5026254Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5029831Z (1396): here 2025-05-07T19:55:30.5030079Z 2025-05-07T19:55:30.5031494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5036463Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5040058Z (1446): here 2025-05-07T19:55:30.5040304Z 2025-05-07T19:55:30.5041620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5046305Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5049874Z (1496): here 2025-05-07T19:55:30.5050127Z 2025-05-07T19:55:30.5051420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5056142Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5059811Z (1546): here 2025-05-07T19:55:30.5060075Z 2025-05-07T19:55:30.5061381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5066059Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5069564Z (1596): here 2025-05-07T19:55:30.5069811Z 2025-05-07T19:55:30.5071226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5076073Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5079522Z (1646): here 2025-05-07T19:55:30.5079770Z 2025-05-07T19:55:30.5081167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5086261Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5089863Z (1696): here 2025-05-07T19:55:30.5090113Z 2025-05-07T19:55:30.5091534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5096658Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5100303Z (1746): here 2025-05-07T19:55:30.5100552Z 2025-05-07T19:55:30.5101973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5106833Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5110537Z (1796): here 2025-05-07T19:55:30.5110800Z 2025-05-07T19:55:30.5112185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5117164Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5120779Z (1846): here 2025-05-07T19:55:30.5121027Z 2025-05-07T19:55:30.5122437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5127270Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5130905Z (1896): here 2025-05-07T19:55:30.5131153Z 2025-05-07T19:55:30.5132573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5137423Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5140917Z (1946): here 2025-05-07T19:55:30.5141178Z 2025-05-07T19:55:30.5142579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5147614Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5151241Z (1996): here 2025-05-07T19:55:30.5151487Z 2025-05-07T19:55:30.5153009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5157843Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5161656Z (2046): here 2025-05-07T19:55:30.5161902Z 2025-05-07T19:55:30.5163317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5168160Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5171601Z (2096): here 2025-05-07T19:55:30.5171851Z 2025-05-07T19:55:30.5173670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.5175752Z 2025-05-07T19:55:30.5177571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.5179644Z 2025-05-07T19:55:30.5181041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5186080Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5189626Z (946): here 2025-05-07T19:55:30.5189900Z 2025-05-07T19:55:30.5191311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5196238Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5199490Z (996): here 2025-05-07T19:55:30.5199737Z 2025-05-07T19:55:30.5201365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5206113Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5209591Z (1046): here 2025-05-07T19:55:30.5209818Z 2025-05-07T19:55:30.5211240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5215943Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5219750Z (1096): here 2025-05-07T19:55:30.5220003Z 2025-05-07T19:55:30.5221296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5226012Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5229462Z (1146): here 2025-05-07T19:55:30.5229750Z 2025-05-07T19:55:30.5231163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5235948Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5239532Z (1196): here 2025-05-07T19:55:30.5239783Z 2025-05-07T19:55:30.5241196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5246014Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5249586Z (1246): here 2025-05-07T19:55:30.5249832Z 2025-05-07T19:55:30.5251237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5256281Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5259875Z (1296): here 2025-05-07T19:55:30.5260124Z 2025-05-07T19:55:30.5261547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5266006Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5269685Z (1346): here 2025-05-07T19:55:30.5269949Z 2025-05-07T19:55:30.5271349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5276099Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5279693Z (1396): here 2025-05-07T19:55:30.5279895Z 2025-05-07T19:55:30.5281288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5286296Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5289772Z (1446): here 2025-05-07T19:55:30.5290019Z 2025-05-07T19:55:30.5291341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5296352Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5299749Z (1496): here 2025-05-07T19:55:30.5299997Z 2025-05-07T19:55:30.5301405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5306512Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5310114Z (1546): here 2025-05-07T19:55:30.5310360Z 2025-05-07T19:55:30.5311775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5316718Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5320528Z (1596): here 2025-05-07T19:55:30.5320796Z 2025-05-07T19:55:30.5322201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5326913Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5330517Z (1646): here 2025-05-07T19:55:30.5330769Z 2025-05-07T19:55:30.5332190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5337020Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5340627Z (1696): here 2025-05-07T19:55:30.5340872Z 2025-05-07T19:55:30.5342245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5347059Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5350619Z (1746): here 2025-05-07T19:55:30.5350864Z 2025-05-07T19:55:30.5352280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5357483Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5361131Z (1796): here 2025-05-07T19:55:30.5361393Z 2025-05-07T19:55:30.5362782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5367885Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5371667Z (1846): here 2025-05-07T19:55:30.5371927Z 2025-05-07T19:55:30.5373350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5378053Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5381674Z (1896): here 2025-05-07T19:55:30.5381922Z 2025-05-07T19:55:30.5383321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5388465Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5392072Z (1946): here 2025-05-07T19:55:30.5392321Z 2025-05-07T19:55:30.5393854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5398686Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5402308Z (1996): here 2025-05-07T19:55:30.5402526Z 2025-05-07T19:55:30.5403693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5408716Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5412217Z (2046): here 2025-05-07T19:55:30.5412481Z 2025-05-07T19:55:30.5413796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5418555Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5422381Z (2096): here 2025-05-07T19:55:30.5422631Z 2025-05-07T19:55:30.5424302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.5426316Z 2025-05-07T19:55:30.5428134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.5430121Z 2025-05-07T19:55:30.5431545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5436291Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5439827Z (946): here 2025-05-07T19:55:30.5440085Z 2025-05-07T19:55:30.5441485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5446267Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5449817Z (996): here 2025-05-07T19:55:30.5450079Z 2025-05-07T19:55:30.5451494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5456309Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5459639Z (1046): here 2025-05-07T19:55:30.5459836Z 2025-05-07T19:55:30.5461243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5466232Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5469639Z (1096): here 2025-05-07T19:55:30.5469889Z 2025-05-07T19:55:30.5471310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5476088Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5479776Z (1146): here 2025-05-07T19:55:30.5480024Z 2025-05-07T19:55:30.5481328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5486277Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5489731Z (1196): here 2025-05-07T19:55:30.5490000Z 2025-05-07T19:55:30.5491404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5510079Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5513861Z (1246): here 2025-05-07T19:55:30.5514143Z 2025-05-07T19:55:30.5515563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5520391Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5523986Z (1296): here 2025-05-07T19:55:30.5524238Z 2025-05-07T19:55:30.5525667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5530746Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5534311Z (1346): here 2025-05-07T19:55:30.5534579Z 2025-05-07T19:55:30.5535976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5540796Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5544585Z (1396): here 2025-05-07T19:55:30.5544851Z 2025-05-07T19:55:30.5546261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5551029Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5554760Z (1446): here 2025-05-07T19:55:30.5555011Z 2025-05-07T19:55:30.5556433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5561179Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5564735Z (1496): here 2025-05-07T19:55:30.5564983Z 2025-05-07T19:55:30.5566401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5571095Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5574656Z (1546): here 2025-05-07T19:55:30.5574905Z 2025-05-07T19:55:30.5576320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5581329Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5584906Z (1596): here 2025-05-07T19:55:30.5585173Z 2025-05-07T19:55:30.5586843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5591626Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5595472Z (1646): here 2025-05-07T19:55:30.5595717Z 2025-05-07T19:55:30.5597133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5601940Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5605507Z (1696): here 2025-05-07T19:55:30.5605757Z 2025-05-07T19:55:30.5607152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5611989Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5615577Z (1746): here 2025-05-07T19:55:30.5615824Z 2025-05-07T19:55:30.5617243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5622049Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5625233Z (1796): here 2025-05-07T19:55:30.5625502Z 2025-05-07T19:55:30.5626822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5631928Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5635535Z (1846): here 2025-05-07T19:55:30.5635801Z 2025-05-07T19:55:30.5637091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5641824Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5645605Z (1896): here 2025-05-07T19:55:30.5645853Z 2025-05-07T19:55:30.5647181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5651877Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5655431Z (1946): here 2025-05-07T19:55:30.5655680Z 2025-05-07T19:55:30.5657082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5661780Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5665376Z (1996): here 2025-05-07T19:55:30.5665624Z 2025-05-07T19:55:30.5667044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5671887Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5675638Z (2046): here 2025-05-07T19:55:30.5675904Z 2025-05-07T19:55:30.5677306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:30.5682139Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:30.5686015Z (2096): here 2025-05-07T19:55:30.5686274Z 2025-05-07T19:55:31.5102498Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:55:31.5125459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5127348Z 2025-05-07T19:55:31.5129103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5131056Z 2025-05-07T19:55:31.5132641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5134450Z 2025-05-07T19:55:31.5136050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5137867Z 2025-05-07T19:55:31.5139430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5141221Z 2025-05-07T19:55:31.5144245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5146090Z 2025-05-07T19:55:31.5147849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5149688Z 2025-05-07T19:55:31.5151203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5153302Z 2025-05-07T19:55:31.5154784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5156712Z 2025-05-07T19:55:31.5158320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5160097Z 2025-05-07T19:55:31.5161470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5163202Z 2025-05-07T19:55:31.5164685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5166322Z 2025-05-07T19:55:31.5167908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5169622Z 2025-05-07T19:55:31.5171365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5173104Z 2025-05-07T19:55:31.5174588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5175987Z 2025-05-07T19:55:31.5177482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5179199Z 2025-05-07T19:55:31.5180602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5182435Z 2025-05-07T19:55:31.5183934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5185658Z 2025-05-07T19:55:35.0875730Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:35.0897301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.0899164Z 2025-05-07T19:55:35.0900898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.0902710Z 2025-05-07T19:55:35.0904085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.0905965Z 2025-05-07T19:55:35.0907520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.0909330Z 2025-05-07T19:55:35.0910916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.0912792Z 2025-05-07T19:55:35.0914408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:35.0916381Z 2025-05-07T19:55:42.5589769Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:55:42.5613540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.5615497Z 2025-05-07T19:55:42.5617223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.5619142Z 2025-05-07T19:55:42.5620819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.5622890Z 2025-05-07T19:55:42.5624577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.5626507Z 2025-05-07T19:55:42.5628178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.5630055Z 2025-05-07T19:55:42.5631759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:42.5633781Z 2025-05-07T19:55:53.8075856Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:53.8095986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:53.8097848Z 2025-05-07T19:55:53.8099406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:53.8101119Z 2025-05-07T19:55:53.8102690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:53.8104415Z 2025-05-07T19:55:53.8105924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:53.8107505Z 2025-05-07T19:55:53.8108871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:53.8110586Z 2025-05-07T19:55:53.8112030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:53.8113798Z 2025-05-07T19:56:28.7277253Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:28.7301637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.7303694Z 2025-05-07T19:56:28.7305566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.7307505Z 2025-05-07T19:56:28.7309240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.7311299Z 2025-05-07T19:56:28.7313174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.7315098Z 2025-05-07T19:56:28.7316775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.7318676Z 2025-05-07T19:56:28.7320759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:28.7322697Z 2025-05-07T19:56:30.8753176Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:30.8778382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.8780280Z 2025-05-07T19:56:30.8781832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.8783719Z 2025-05-07T19:56:30.8785438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.8787649Z 2025-05-07T19:56:30.8789395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.8791371Z 2025-05-07T19:56:30.8793554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.8795474Z 2025-05-07T19:56:30.8797197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.8799123Z 2025-05-07T19:56:31.1708224Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:31.1731251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1733255Z 2025-05-07T19:56:31.1734998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1736918Z 2025-05-07T19:56:31.1738640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1740571Z 2025-05-07T19:56:31.1742322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1744278Z 2025-05-07T19:56:31.1746317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1748274Z 2025-05-07T19:56:31.1749978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1751914Z 2025-05-07T19:56:33.0028232Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:33.0049956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0051668Z 2025-05-07T19:56:33.0053350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0054910Z 2025-05-07T19:56:33.0056241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0058067Z 2025-05-07T19:56:33.0059996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0061801Z 2025-05-07T19:56:33.0063407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0064846Z 2025-05-07T19:56:33.0065988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0067469Z 2025-05-07T19:56:33.3459521Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:56:33.3483281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.3485245Z 2025-05-07T19:56:33.3487214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.3489385Z 2025-05-07T19:56:33.3491481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.3493431Z 2025-05-07T19:56:33.3495177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.3497144Z 2025-05-07T19:56:33.3498872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.3500845Z 2025-05-07T19:56:33.3502488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.3504333Z 2025-05-07T19:56:34.8115462Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:34.8138913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.8140946Z 2025-05-07T19:56:34.8142925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.8144472Z 2025-05-07T19:56:34.8146123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.8147702Z 2025-05-07T19:56:34.8149125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.8150645Z 2025-05-07T19:56:34.8152066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.8153937Z 2025-05-07T19:56:34.8155754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.8157368Z 2025-05-07T19:56:35.2297081Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:56:35.2318530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.2320121Z 2025-05-07T19:56:35.2322244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.2324232Z 2025-05-07T19:56:35.2325904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.2327762Z 2025-05-07T19:56:35.2329532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.2331499Z 2025-05-07T19:56:35.2333239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.2335394Z 2025-05-07T19:56:35.2337148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.2339129Z 2025-05-07T19:56:35.5633145Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:35.5655278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.5656770Z 2025-05-07T19:56:35.5658236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.5659999Z 2025-05-07T19:56:35.5661554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.5663366Z 2025-05-07T19:56:35.5664967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.5666935Z 2025-05-07T19:56:35.5668588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.5669963Z 2025-05-07T19:56:35.5671254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.5672878Z 2025-05-07T19:56:36.1405579Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:56:36.1427568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.1429320Z 2025-05-07T19:56:36.1430652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.1432276Z 2025-05-07T19:56:36.1433990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1436011Z 2025-05-07T19:56:36.1437480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1439138Z 2025-05-07T19:56:36.1440606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1442347Z 2025-05-07T19:56:36.1443892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.1445700Z 2025-05-07T19:56:36.1447388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.1449147Z 2025-05-07T19:56:36.1450620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1452391Z 2025-05-07T19:56:36.1453839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1455552Z 2025-05-07T19:56:36.1457177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1458890Z 2025-05-07T19:56:36.1460322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.1462086Z 2025-05-07T19:56:36.1463223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.1464616Z 2025-05-07T19:56:36.1465760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1467553Z 2025-05-07T19:56:36.1469442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1471030Z 2025-05-07T19:56:36.1472645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:36.1474189Z 2025-05-07T19:56:36.3619772Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:36.3641050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.3642734Z 2025-05-07T19:56:36.3644051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.3645723Z 2025-05-07T19:56:36.3647200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.3648873Z 2025-05-07T19:56:36.3650551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.3652360Z 2025-05-07T19:56:36.3654399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.3656334Z 2025-05-07T19:56:36.3657883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.3659711Z 2025-05-07T19:56:36.8970463Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:36.8992748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.8994403Z 2025-05-07T19:56:36.8995933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.8997851Z 2025-05-07T19:56:36.8999341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.9001101Z 2025-05-07T19:56:36.9002940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.9004727Z 2025-05-07T19:56:36.9006268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.9008023Z 2025-05-07T19:56:36.9009617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:36.9011059Z 2025-05-07T19:56:38.4137403Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:56:38.4163291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.4165214Z 2025-05-07T19:56:38.4166933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.4168889Z 2025-05-07T19:56:38.4170276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:38.4171870Z 2025-05-07T19:56:38.4173939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.4175849Z 2025-05-07T19:56:38.4177552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.4179477Z 2025-05-07T19:56:38.4180839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:38.4182439Z 2025-05-07T19:56:38.4184516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.4186728Z 2025-05-07T19:56:38.4188627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:38.4190606Z 2025-05-07T19:56:39.3497756Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:39.3521206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.3523154Z 2025-05-07T19:56:39.3524837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.3526788Z 2025-05-07T19:56:39.3528457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.3530274Z 2025-05-07T19:56:39.3531940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.3534063Z 2025-05-07T19:56:39.3535731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.3537648Z 2025-05-07T19:56:39.3539267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.3541080Z 2025-05-07T19:56:40.7397585Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:40.7427800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.7430158Z 2025-05-07T19:56:40.7432274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.7434783Z 2025-05-07T19:56:40.7436767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.7439419Z 2025-05-07T19:56:40.7441502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.7443874Z 2025-05-07T19:56:40.7445996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.7448344Z 2025-05-07T19:56:40.7450434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:40.7452827Z 2025-05-07T19:56:42.3500333Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:42.3523719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3525399Z 2025-05-07T19:56:42.3527188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3529206Z 2025-05-07T19:56:42.3530787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3533013Z 2025-05-07T19:56:42.3534727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3536422Z 2025-05-07T19:56:42.3538160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3540119Z 2025-05-07T19:56:42.3541880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.3543925Z 2025-05-07T19:56:44.0476587Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:44.0499830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.0501868Z 2025-05-07T19:56:44.0503639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.0506011Z 2025-05-07T19:56:44.0507740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.0509663Z 2025-05-07T19:56:44.0511326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.0513287Z 2025-05-07T19:56:44.0515012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.0517010Z 2025-05-07T19:56:44.0518758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.0520730Z 2025-05-07T19:56:44.2989765Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:44.3010231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3012200Z 2025-05-07T19:56:44.3013726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3015418Z 2025-05-07T19:56:44.3016887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3018555Z 2025-05-07T19:56:44.3020055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3021744Z 2025-05-07T19:56:44.3023202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3024921Z 2025-05-07T19:56:44.3026380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3027805Z 2025-05-07T19:56:44.3967103Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:56:44.3990252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3992413Z 2025-05-07T19:56:44.3994204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3996149Z 2025-05-07T19:56:44.3997843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.3999732Z 2025-05-07T19:56:44.4001168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.4003107Z 2025-05-07T19:56:44.4004830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.4006761Z 2025-05-07T19:56:44.4008454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.4010392Z 2025-05-07T19:56:44.5420964Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:44.5441832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.5443484Z 2025-05-07T19:56:44.5444975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.5446623Z 2025-05-07T19:56:44.5448087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.5449739Z 2025-05-07T19:56:44.5451231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.5452883Z 2025-05-07T19:56:44.5454354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.5455976Z 2025-05-07T19:56:44.5457448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.5459108Z 2025-05-07T19:56:45.2905614Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:45.2927221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.2929084Z 2025-05-07T19:56:45.2930718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.2932523Z 2025-05-07T19:56:45.2934132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.2935890Z 2025-05-07T19:56:45.2937528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.2939368Z 2025-05-07T19:56:45.2941029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.2942828Z 2025-05-07T19:56:45.2944387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.2945990Z 2025-05-07T19:56:45.6726945Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:45.6749957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.6751827Z 2025-05-07T19:56:45.6753625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.6755534Z 2025-05-07T19:56:45.6757165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.6759052Z 2025-05-07T19:56:45.6760747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.6762642Z 2025-05-07T19:56:45.6764283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.6766148Z 2025-05-07T19:56:45.6767828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.6769363Z 2025-05-07T19:56:45.8517134Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:45.8540400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.8542340Z 2025-05-07T19:56:45.8544087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.8546003Z 2025-05-07T19:56:45.8547610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.8549395Z 2025-05-07T19:56:45.8551084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.8553143Z 2025-05-07T19:56:45.8554809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.8556749Z 2025-05-07T19:56:45.8558458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.8560415Z 2025-05-07T19:56:46.5780348Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:46.5804593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.5806589Z 2025-05-07T19:56:46.5808294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.5810270Z 2025-05-07T19:56:46.5811927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.5813862Z 2025-05-07T19:56:46.5815593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.5817528Z 2025-05-07T19:56:46.5819233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.5821101Z 2025-05-07T19:56:46.5822601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.5824430Z 2025-05-07T19:56:46.6224194Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:56:46.6244116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6245502Z 2025-05-07T19:56:46.6246794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6248428Z 2025-05-07T19:56:46.6249834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6251420Z 2025-05-07T19:56:46.6252925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6254519Z 2025-05-07T19:56:46.6255923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6257540Z 2025-05-07T19:56:46.6258943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6260428Z 2025-05-07T19:56:47.6508613Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:47.6532591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.6534551Z 2025-05-07T19:56:47.6536300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.6538165Z 2025-05-07T19:56:47.6539779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.6541684Z 2025-05-07T19:56:47.6543337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.6545068Z 2025-05-07T19:56:47.6546676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.6548486Z 2025-05-07T19:56:47.6550133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:47.6551928Z 2025-05-07T19:56:48.8754023Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:48.8775056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.8776767Z 2025-05-07T19:56:48.8778265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.8779893Z 2025-05-07T19:56:48.8781471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.8783126Z 2025-05-07T19:56:48.8784616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.8786595Z 2025-05-07T19:56:48.8788117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.8789810Z 2025-05-07T19:56:48.8791626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.8793486Z 2025-05-07T19:56:52.5447173Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:56:52.5470871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5472967Z 2025-05-07T19:56:52.5474691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5476658Z 2025-05-07T19:56:52.5478329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5479810Z 2025-05-07T19:56:52.5481378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5483266Z 2025-05-07T19:56:52.5484909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5487242Z 2025-05-07T19:56:52.5488918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5490809Z 2025-05-07T19:56:52.5756953Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:52.5780397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5782387Z 2025-05-07T19:56:52.5784031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5786132Z 2025-05-07T19:56:52.5787808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5789568Z 2025-05-07T19:56:52.5791200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5793329Z 2025-05-07T19:56:52.5795287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5797232Z 2025-05-07T19:56:52.5798950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.5800901Z 2025-05-07T19:56:53.4083605Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:56:53.4107651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4109602Z 2025-05-07T19:56:53.4111294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4113289Z 2025-05-07T19:56:53.4114885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4116758Z 2025-05-07T19:56:53.4118927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4120889Z 2025-05-07T19:56:53.4122585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4124520Z 2025-05-07T19:56:53.4126228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4128161Z 2025-05-07T19:56:54.5263151Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:54.5287548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5289443Z 2025-05-07T19:56:54.5291197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5293094Z 2025-05-07T19:56:54.5295049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5296974Z 2025-05-07T19:56:54.5298656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5300587Z 2025-05-07T19:56:54.5302224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5304096Z 2025-05-07T19:56:54.5305826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5307979Z 2025-05-07T19:56:55.1723149Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:56:55.1745803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.1747762Z 2025-05-07T19:56:55.1749421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.1751304Z 2025-05-07T19:56:55.1753189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:55.1754748Z 2025-05-07T19:56:55.1756372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.1758221Z 2025-05-07T19:56:55.1759916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.1761738Z 2025-05-07T19:56:55.1763123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:55.1764962Z 2025-05-07T19:56:55.1766704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.1768634Z 2025-05-07T19:56:55.1770357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:55.1772236Z 2025-05-07T19:56:56.4362818Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:56.4386859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4388850Z 2025-05-07T19:56:56.4390571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4392472Z 2025-05-07T19:56:56.4394038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4395951Z 2025-05-07T19:56:56.4397377Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4399035Z 2025-05-07T19:56:56.4400324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4401893Z 2025-05-07T19:56:56.4403334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4404973Z 2025-05-07T19:56:56.4406311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4408025Z 2025-05-07T19:56:56.4409367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4410676Z 2025-05-07T19:56:56.4411683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4412763Z 2025-05-07T19:56:56.4413779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4414997Z 2025-05-07T19:56:56.4416359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4417783Z 2025-05-07T19:56:56.4419073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4420802Z 2025-05-07T19:56:56.4422061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4423572Z 2025-05-07T19:56:56.4424875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4426246Z 2025-05-07T19:56:56.4427515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4429415Z 2025-05-07T19:56:56.4430767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4432360Z 2025-05-07T19:56:56.4433800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4435393Z 2025-05-07T19:56:56.4436713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4438386Z 2025-05-07T19:56:56.4439666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:56.4441324Z 2025-05-07T19:56:56.4442597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:56.4444251Z 2025-05-07T19:56:56.4445942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4447827Z 2025-05-07T19:56:56.4449566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4451499Z 2025-05-07T19:56:56.4753441Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:56.4776258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4778082Z 2025-05-07T19:56:56.4779740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4782054Z 2025-05-07T19:56:56.4783797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4785647Z 2025-05-07T19:56:56.4787742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4789670Z 2025-05-07T19:56:56.4791425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4793475Z 2025-05-07T19:56:56.4795191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.4797094Z 2025-05-07T19:57:02.4469416Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:02.4492655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.4494851Z 2025-05-07T19:57:02.4496576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.4498486Z 2025-05-07T19:57:02.4500113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.4501956Z 2025-05-07T19:57:02.4503629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.4505535Z 2025-05-07T19:57:02.4507198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.4509145Z 2025-05-07T19:57:02.4510848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.4512856Z 2025-05-07T19:57:03.4815144Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:03.4839018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.4841266Z 2025-05-07T19:57:03.4843043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.4845000Z 2025-05-07T19:57:03.4846658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.4848498Z 2025-05-07T19:57:03.4850131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.4852083Z 2025-05-07T19:57:03.4853724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.4855668Z 2025-05-07T19:57:03.4857376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.4859315Z 2025-05-07T19:57:04.0573284Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:04.0595166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.0596539Z 2025-05-07T19:57:04.0597997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.0599628Z 2025-05-07T19:57:04.0601129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.0602733Z 2025-05-07T19:57:04.0604260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.0606169Z 2025-05-07T19:57:04.0607805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.0609636Z 2025-05-07T19:57:04.0611189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.0613077Z 2025-05-07T19:57:04.5470294Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:57:04.5493918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5495862Z 2025-05-07T19:57:04.5497561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5499442Z 2025-05-07T19:57:04.5501067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5502788Z 2025-05-07T19:57:04.5504365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5506153Z 2025-05-07T19:57:04.5507667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5509497Z 2025-05-07T19:57:04.5511082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5513021Z 2025-05-07T19:57:04.5514762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5516688Z 2025-05-07T19:57:04.5518415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5520364Z 2025-05-07T19:57:04.5521950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5523668Z 2025-05-07T19:57:04.5525555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5527367Z 2025-05-07T19:57:04.5528962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5530769Z 2025-05-07T19:57:04.5532319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5534122Z 2025-05-07T19:57:04.5535826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5537909Z 2025-05-07T19:57:04.5539658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5541434Z 2025-05-07T19:57:04.5542946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5544770Z 2025-05-07T19:57:04.5546328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5548113Z 2025-05-07T19:57:04.5549702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5551535Z 2025-05-07T19:57:04.5553251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:04.5555062Z 2025-05-07T19:57:06.4908437Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:57:06.4931313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.4933447Z 2025-05-07T19:57:06.4935129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.4937009Z 2025-05-07T19:57:06.4938473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4940058Z 2025-05-07T19:57:06.4941565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4943383Z 2025-05-07T19:57:06.4944959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4946645Z 2025-05-07T19:57:06.4948132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4949903Z 2025-05-07T19:57:06.4951573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.4953554Z 2025-05-07T19:57:06.4955083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.4957011Z 2025-05-07T19:57:06.4958753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4960447Z 2025-05-07T19:57:06.4961879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4963640Z 2025-05-07T19:57:06.4965198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4966934Z 2025-05-07T19:57:06.4968638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4970382Z 2025-05-07T19:57:06.4972015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.4973881Z 2025-05-07T19:57:06.4975457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.4977233Z 2025-05-07T19:57:06.4978794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4980827Z 2025-05-07T19:57:06.4982286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4983916Z 2025-05-07T19:57:06.4985387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4987301Z 2025-05-07T19:57:06.4988761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:06.4990454Z 2025-05-07T19:57:07.5246529Z [290/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:57:10.4307443Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:10.4328415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4330155Z 2025-05-07T19:57:10.4331673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4333463Z 2025-05-07T19:57:10.4334983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4336707Z 2025-05-07T19:57:10.4338237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4339797Z 2025-05-07T19:57:10.4341021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4342585Z 2025-05-07T19:57:10.4343939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.4345449Z 2025-05-07T19:57:15.6439810Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:15.6461012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.6462879Z 2025-05-07T19:57:15.6464515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.6466418Z 2025-05-07T19:57:15.6468139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.6470045Z 2025-05-07T19:57:15.6471717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.6473663Z 2025-05-07T19:57:15.6475162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.6476782Z 2025-05-07T19:57:15.6478370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.6480193Z 2025-05-07T19:57:20.4555152Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:20.4578110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.4580120Z 2025-05-07T19:57:20.4581787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.4583707Z 2025-05-07T19:57:20.4585382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.4587801Z 2025-05-07T19:57:20.4589371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.4591245Z 2025-05-07T19:57:20.4592999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.4594674Z 2025-05-07T19:57:20.4596348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.4598189Z 2025-05-07T19:57:21.7049754Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:21.7073961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.7075915Z 2025-05-07T19:57:21.7077443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.7079116Z 2025-05-07T19:57:21.7080445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.7082139Z 2025-05-07T19:57:21.7083837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.7086000Z 2025-05-07T19:57:21.7087681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.7089552Z 2025-05-07T19:57:21.7091262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:21.7093163Z 2025-05-07T19:57:23.7675284Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:23.7699115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.7700933Z 2025-05-07T19:57:23.7702700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.7704681Z 2025-05-07T19:57:23.7706447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.7708412Z 2025-05-07T19:57:23.7710160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.7712092Z 2025-05-07T19:57:23.7713804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.7715567Z 2025-05-07T19:57:23.7717202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.7719197Z 2025-05-07T19:57:23.9022298Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:23.9045641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.9047616Z 2025-05-07T19:57:23.9049363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.9051384Z 2025-05-07T19:57:23.9053139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.9055109Z 2025-05-07T19:57:23.9056877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.9058852Z 2025-05-07T19:57:23.9064830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.9066816Z 2025-05-07T19:57:23.9068708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.9070652Z 2025-05-07T19:57:25.4433376Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:25.4457384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4459364Z 2025-05-07T19:57:25.4460764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4462523Z 2025-05-07T19:57:25.4464089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4465878Z 2025-05-07T19:57:25.4467356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4469615Z 2025-05-07T19:57:25.4471326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4473472Z 2025-05-07T19:57:25.4475443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.4477186Z 2025-05-07T19:57:27.0049802Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:27.0068300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.0069962Z 2025-05-07T19:57:27.0071264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.0072893Z 2025-05-07T19:57:27.0074145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.0075869Z 2025-05-07T19:57:27.0077197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.0078734Z 2025-05-07T19:57:27.0080319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.0081824Z 2025-05-07T19:57:27.0083121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:27.0084641Z 2025-05-07T19:57:29.7799077Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:29.7822203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7824089Z 2025-05-07T19:57:29.7825781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7827612Z 2025-05-07T19:57:29.7829415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7831189Z 2025-05-07T19:57:29.7833168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7835000Z 2025-05-07T19:57:29.7836512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7838062Z 2025-05-07T19:57:29.7839550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.7841395Z 2025-05-07T19:57:29.9455279Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:29.9479321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.9481381Z 2025-05-07T19:57:29.9483150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.9485423Z 2025-05-07T19:57:29.9487394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.9489428Z 2025-05-07T19:57:29.9491450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.9493438Z 2025-05-07T19:57:29.9495152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.9497101Z 2025-05-07T19:57:29.9498871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.9500903Z 2025-05-07T19:57:30.1316744Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:30.1340173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.1342503Z 2025-05-07T19:57:30.1344157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.1346186Z 2025-05-07T19:57:30.1348230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.1350099Z 2025-05-07T19:57:30.1351813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.1353980Z 2025-05-07T19:57:30.1355652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.1357506Z 2025-05-07T19:57:30.1359243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.1361415Z 2025-05-07T19:57:30.3206679Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:30.3229118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.3230860Z 2025-05-07T19:57:30.3233030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.3234872Z 2025-05-07T19:57:30.3236408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.3238206Z 2025-05-07T19:57:30.3239751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.3241590Z 2025-05-07T19:57:30.3243233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.3245256Z 2025-05-07T19:57:30.3246915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.3248761Z 2025-05-07T19:57:31.8887836Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:31.8908625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8910447Z 2025-05-07T19:57:31.8912391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8914244Z 2025-05-07T19:57:31.8915773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8917575Z 2025-05-07T19:57:31.8919063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8921024Z 2025-05-07T19:57:31.8922642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8924289Z 2025-05-07T19:57:31.8925865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8927555Z 2025-05-07T19:57:31.9527560Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:31.9548698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.9550433Z 2025-05-07T19:57:31.9551973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.9553791Z 2025-05-07T19:57:31.9555276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.9556992Z 2025-05-07T19:57:31.9558514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.9560447Z 2025-05-07T19:57:31.9561901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.9563621Z 2025-05-07T19:57:31.9565099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.9566805Z 2025-05-07T19:57:33.5202443Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:33.5223730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.5225623Z 2025-05-07T19:57:33.5227076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.5228539Z 2025-05-07T19:57:33.5229972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.5231966Z 2025-05-07T19:57:33.5233521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.5235199Z 2025-05-07T19:57:33.5236698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.5238429Z 2025-05-07T19:57:33.5240028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:33.5241749Z 2025-05-07T19:57:34.2185513Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:34.2209704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.2211691Z 2025-05-07T19:57:34.2229363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.2231742Z 2025-05-07T19:57:34.2233668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.2235655Z 2025-05-07T19:57:34.2237450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.2239433Z 2025-05-07T19:57:34.2241081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.2242932Z 2025-05-07T19:57:34.2244554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.2246422Z 2025-05-07T19:57:35.3329654Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:35.3353009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3355016Z 2025-05-07T19:57:35.3356750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3358973Z 2025-05-07T19:57:35.3360696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3362620Z 2025-05-07T19:57:35.3364355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3366303Z 2025-05-07T19:57:35.3368004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3369869Z 2025-05-07T19:57:35.3371590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3373444Z 2025-05-07T19:57:35.3526058Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:35.3550027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3552213Z 2025-05-07T19:57:35.3554105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3556079Z 2025-05-07T19:57:35.3557669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3559603Z 2025-05-07T19:57:35.3561327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3563173Z 2025-05-07T19:57:35.3564874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3566812Z 2025-05-07T19:57:35.3568524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3570445Z 2025-05-07T19:57:35.6066036Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:35.6087251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6089091Z 2025-05-07T19:57:35.6090789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6092661Z 2025-05-07T19:57:35.6094219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6096026Z 2025-05-07T19:57:35.6097744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6099433Z 2025-05-07T19:57:35.6101169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6102799Z 2025-05-07T19:57:35.6104081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.6105861Z 2025-05-07T19:57:35.8781330Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:35.8805187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.8807262Z 2025-05-07T19:57:35.8809035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.8811040Z 2025-05-07T19:57:35.8812786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.8814763Z 2025-05-07T19:57:35.8816550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.8818525Z 2025-05-07T19:57:35.8820134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.8821969Z 2025-05-07T19:57:35.8823726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.8825725Z 2025-05-07T19:57:36.1480181Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:57:36.1503350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1505336Z 2025-05-07T19:57:36.1507017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1508977Z 2025-05-07T19:57:36.1510657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1512482Z 2025-05-07T19:57:36.1514323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1516227Z 2025-05-07T19:57:36.1517805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1519740Z 2025-05-07T19:57:36.1521414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1523271Z 2025-05-07T19:57:37.6301909Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:37.6324480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.6326265Z 2025-05-07T19:57:37.6327834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.6329643Z 2025-05-07T19:57:37.6331140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.6332680Z 2025-05-07T19:57:37.6334078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.6335562Z 2025-05-07T19:57:37.6336887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.6338344Z 2025-05-07T19:57:37.6339676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:37.6341363Z 2025-05-07T19:57:38.8264500Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:57:38.8286193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.8287953Z 2025-05-07T19:57:38.8289511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.8291240Z 2025-05-07T19:57:38.8292702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.8294208Z 2025-05-07T19:57:38.8295680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.8297250Z 2025-05-07T19:57:38.8298716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.8300282Z 2025-05-07T19:57:38.8301714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.8303281Z 2025-05-07T19:57:38.9865186Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:38.9888536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.9890327Z 2025-05-07T19:57:38.9891981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.9893759Z 2025-05-07T19:57:38.9895367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.9897183Z 2025-05-07T19:57:38.9898537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.9900116Z 2025-05-07T19:57:38.9901475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.9903121Z 2025-05-07T19:57:38.9904550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.9906152Z 2025-05-07T19:57:40.9622042Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:40.9643595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.9645505Z 2025-05-07T19:57:40.9647140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.9648926Z 2025-05-07T19:57:40.9650491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.9652177Z 2025-05-07T19:57:40.9653561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.9655063Z 2025-05-07T19:57:40.9656480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.9658222Z 2025-05-07T19:57:40.9659710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.9661432Z 2025-05-07T19:57:42.3233735Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:57:42.3254084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.3255864Z 2025-05-07T19:57:42.3257372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.3259171Z 2025-05-07T19:57:42.3260726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.3262381Z 2025-05-07T19:57:42.3264108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.3265914Z 2025-05-07T19:57:42.3267467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.3269143Z 2025-05-07T19:57:42.3270809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.3273044Z 2025-05-07T19:57:42.4186877Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T19:57:42.4209129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.4211047Z 2025-05-07T19:57:42.4212780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.4214813Z 2025-05-07T19:57:42.4216464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.4218356Z 2025-05-07T19:57:42.4219745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.4221547Z 2025-05-07T19:57:42.4223180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.4225135Z 2025-05-07T19:57:42.4226661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:42.4228437Z 2025-05-07T19:57:43.2608274Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:43.2632367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.2634375Z 2025-05-07T19:57:43.2636026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.2637988Z 2025-05-07T19:57:43.2639691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.2641596Z 2025-05-07T19:57:43.2643327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.2645272Z 2025-05-07T19:57:43.2646922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.2649292Z 2025-05-07T19:57:43.2651154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.2653182Z 2025-05-07T19:57:44.9319993Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:44.9340704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9342492Z 2025-05-07T19:57:44.9344128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9345874Z 2025-05-07T19:57:44.9347243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9348572Z 2025-05-07T19:57:44.9349866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9351874Z 2025-05-07T19:57:44.9353500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9355140Z 2025-05-07T19:57:44.9357907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:44.9359737Z 2025-05-07T19:57:46.3476444Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:57:46.3498973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3500910Z 2025-05-07T19:57:46.3502655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3504628Z 2025-05-07T19:57:46.3506199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3508098Z 2025-05-07T19:57:46.3510079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3512029Z 2025-05-07T19:57:46.3514050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3515946Z 2025-05-07T19:57:46.3517722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3519723Z 2025-05-07T19:57:46.6580665Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:46.6604494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6606409Z 2025-05-07T19:57:46.6608118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6609994Z 2025-05-07T19:57:46.6611594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6613717Z 2025-05-07T19:57:46.6615446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6617419Z 2025-05-07T19:57:46.6619390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6621321Z 2025-05-07T19:57:46.6623023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6625135Z 2025-05-07T19:57:46.9657032Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:57:46.9678747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.9680552Z 2025-05-07T19:57:46.9682142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.9684095Z 2025-05-07T19:57:46.9685616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.9687592Z 2025-05-07T19:57:46.9689360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.9691096Z 2025-05-07T19:57:46.9692565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.9694291Z 2025-05-07T19:57:46.9695745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.9697398Z 2025-05-07T19:57:47.7818751Z [323/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:47.7842682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.7844634Z 2025-05-07T19:57:47.7846400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.7848570Z 2025-05-07T19:57:47.7851631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.7853635Z 2025-05-07T19:57:47.7855413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.7857385Z 2025-05-07T19:57:47.7859090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.7861022Z 2025-05-07T19:57:47.7862659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.7864695Z 2025-05-07T19:57:54.0183598Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:54.0207289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:54.0209223Z 2025-05-07T19:57:54.0210464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:54.0211769Z 2025-05-07T19:57:54.0213255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:54.0214697Z 2025-05-07T19:57:54.0215858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:54.0217263Z 2025-05-07T19:57:54.0218687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:54.0220275Z 2025-05-07T19:57:54.0221800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:54.0223540Z 2025-05-07T19:57:58.4571096Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T19:57:58.4594476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:58.4596806Z 2025-05-07T19:57:58.4598734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:58.4600665Z 2025-05-07T19:57:58.4602365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:58.4604188Z 2025-05-07T19:57:58.4605938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:58.4607878Z 2025-05-07T19:57:58.4609576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:58.4611598Z 2025-05-07T19:57:58.4613272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:58.4615146Z 2025-05-07T19:58:24.3400172Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:24.3417841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.3419294Z 2025-05-07T19:58:24.3420855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.3422301Z 2025-05-07T19:58:24.3423551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.3424947Z 2025-05-07T19:58:24.3426227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.3427669Z 2025-05-07T19:58:24.3429117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.3430535Z 2025-05-07T19:58:24.3431805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.3433390Z 2025-05-07T19:58:25.6726597Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T19:58:25.6749338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6751297Z 2025-05-07T19:58:25.6753166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6755078Z 2025-05-07T19:58:25.6756726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6758599Z 2025-05-07T19:58:25.6760280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6762286Z 2025-05-07T19:58:25.6763944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6765794Z 2025-05-07T19:58:25.6767423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.6769328Z 2025-05-07T19:58:25.9746688Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:25.9769124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.9771063Z 2025-05-07T19:58:25.9772844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.9774621Z 2025-05-07T19:58:25.9776232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.9778462Z 2025-05-07T19:58:25.9780135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.9781929Z 2025-05-07T19:58:25.9783591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.9785518Z 2025-05-07T19:58:25.9787547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.9789488Z 2025-05-07T19:58:26.2605709Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:26.2628556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2630494Z 2025-05-07T19:58:26.2632205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2634271Z 2025-05-07T19:58:26.2635930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2638078Z 2025-05-07T19:58:26.2639743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2641676Z 2025-05-07T19:58:26.2643360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2645236Z 2025-05-07T19:58:26.2646917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.2648809Z 2025-05-07T19:58:37.7909236Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:37.7931939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7933871Z 2025-05-07T19:58:37.7935594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7937698Z 2025-05-07T19:58:37.7939298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7941188Z 2025-05-07T19:58:37.7942726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7944765Z 2025-05-07T19:58:37.7946440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7948311Z 2025-05-07T19:58:37.7949975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.7951896Z 2025-05-07T19:58:39.4298786Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:39.4315640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.4317086Z 2025-05-07T19:58:39.4318323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.4320193Z 2025-05-07T19:58:39.4321716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.4323172Z 2025-05-07T19:58:39.4324378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.4325852Z 2025-05-07T19:58:39.4327500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.4329703Z 2025-05-07T19:58:39.4331437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.4333543Z 2025-05-07T19:58:39.8843675Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:39.8873651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8876477Z 2025-05-07T19:58:39.8878555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8880993Z 2025-05-07T19:58:39.8883208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8885996Z 2025-05-07T19:58:39.8888145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8890526Z 2025-05-07T19:58:39.8892592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8894939Z 2025-05-07T19:58:39.8897070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.8899505Z 2025-05-07T19:58:39.9010104Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T19:58:39.9037562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.9040108Z 2025-05-07T19:58:39.9042298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.9044671Z 2025-05-07T19:58:39.9046744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.9049120Z 2025-05-07T19:58:39.9051230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.9053574Z 2025-05-07T19:58:39.9055672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.9057995Z 2025-05-07T19:58:39.9060145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.9062580Z 2025-05-07T19:58:40.4929729Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:40.4951457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.4953522Z 2025-05-07T19:58:40.4955293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.4957239Z 2025-05-07T19:58:40.4958870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.4960881Z 2025-05-07T19:58:40.4962652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.4964643Z 2025-05-07T19:58:40.4966289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.4968172Z 2025-05-07T19:58:40.4969944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.4971953Z 2025-05-07T19:58:43.0081802Z [335/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T19:58:43.0104354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.0106215Z 2025-05-07T19:58:43.0107867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.0109721Z 2025-05-07T19:58:43.0111352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.0113309Z 2025-05-07T19:58:43.0114926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.0116761Z 2025-05-07T19:58:43.0118346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.0120188Z 2025-05-07T19:58:43.0121814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.0123662Z 2025-05-07T19:58:44.2849823Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:44.2872062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2873916Z 2025-05-07T19:58:44.2875495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2877384Z 2025-05-07T19:58:44.2878965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2880797Z 2025-05-07T19:58:44.2882529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2884465Z 2025-05-07T19:58:44.2886407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2888125Z 2025-05-07T19:58:44.2889655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2891579Z 2025-05-07T19:58:45.3710949Z [337/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:45.3727606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.3729014Z 2025-05-07T19:58:45.3730184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.3731478Z 2025-05-07T19:58:45.3732608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.3733893Z 2025-05-07T19:58:45.3735050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.3736350Z 2025-05-07T19:58:45.3737526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.3738808Z 2025-05-07T19:58:45.3739968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:45.3741290Z 2025-05-07T19:58:47.2901564Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T19:58:47.2925548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2927554Z 2025-05-07T19:58:47.2929353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2931314Z 2025-05-07T19:58:47.2933033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2934984Z 2025-05-07T19:58:47.2936732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2938853Z 2025-05-07T19:58:47.2940582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2942539Z 2025-05-07T19:58:47.2944266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2946265Z 2025-05-07T19:58:47.9980337Z [339/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:58:48.0003831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.0005797Z 2025-05-07T19:58:48.0007250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.0008934Z 2025-05-07T19:58:48.0010289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.0011924Z 2025-05-07T19:58:48.0013360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.0014996Z 2025-05-07T19:58:48.0016464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.0018137Z 2025-05-07T19:58:48.0019724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.0021828Z 2025-05-07T19:58:48.2084409Z [340/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:58:48.2102822Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:48.2838766Z [341/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:58:48.2854925Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:48.9550865Z [342/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:48.9571048Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:49.3604306Z [343/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:49.3624651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.3626159Z 2025-05-07T19:58:49.3627463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.3629051Z 2025-05-07T19:58:49.3630387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.3631885Z 2025-05-07T19:58:49.3633437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.3635009Z 2025-05-07T19:58:49.3636366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.3637888Z 2025-05-07T19:58:49.3639141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.3640779Z 2025-05-07T19:58:49.6231472Z [344/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:49.6254896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.6256560Z 2025-05-07T19:58:49.6258187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.6259870Z 2025-05-07T19:58:49.6261384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.6263012Z 2025-05-07T19:58:49.6264543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.6266057Z 2025-05-07T19:58:49.6267664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.6269553Z 2025-05-07T19:58:49.6271241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.6273316Z 2025-05-07T19:58:49.7493969Z [345/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T19:58:49.7516719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.7518613Z 2025-05-07T19:58:49.7520283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.7522192Z 2025-05-07T19:58:49.7523760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.7525650Z 2025-05-07T19:58:49.7527329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.7529282Z 2025-05-07T19:58:49.7530786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.7532507Z 2025-05-07T19:58:49.7534039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.7536050Z 2025-05-07T19:58:51.4137298Z [346/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:58:51.4158443Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:52.5404155Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:52.5425910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.5427619Z 2025-05-07T19:58:52.5429219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.5431112Z 2025-05-07T19:58:52.5432887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.5435011Z 2025-05-07T19:58:52.5436568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.5438316Z 2025-05-07T19:58:52.5439975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.5441739Z 2025-05-07T19:58:52.5443245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.5444953Z 2025-05-07T19:58:52.7674663Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:58:52.7696186Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:52.8264633Z [349/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:52.8286867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8288696Z 2025-05-07T19:58:52.8290168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8291820Z 2025-05-07T19:58:52.8293307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8295275Z 2025-05-07T19:58:52.8296729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8298482Z 2025-05-07T19:58:52.8300159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8301670Z 2025-05-07T19:58:52.8303111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8304790Z 2025-05-07T19:58:54.0262714Z [350/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:54.0282434Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:55.6232454Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:58:55.6250142Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:56.0826454Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:56.0842204Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:56.3476623Z [353/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:58:56.3495110Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:56.4074780Z [354/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:56.4093766Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:57.2584282Z [355/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:58:57.2600612Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:57.6405178Z [356/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:57.6421836Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:59.0834333Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:59.0855160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0856753Z 2025-05-07T19:58:59.0858086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0859840Z 2025-05-07T19:58:59.0861167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0862992Z 2025-05-07T19:58:59.0864654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.0866270Z 2025-05-07T19:58:59.6656235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6658119Z 2025-05-07T19:58:59.6659718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6661616Z 2025-05-07T19:58:59.6681982Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T19:58:59.6705303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6707480Z 2025-05-07T19:58:59.6709225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6710963Z 2025-05-07T19:58:59.6712974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6714808Z 2025-05-07T19:58:59.6716406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6718198Z 2025-05-07T19:58:59.6719763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6721530Z 2025-05-07T19:58:59.6723100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:59.6725103Z 2025-05-07T19:59:04.8688127Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T19:59:04.8711256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8713215Z 2025-05-07T19:59:04.8715319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8717073Z 2025-05-07T19:59:04.8718535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8720314Z 2025-05-07T19:59:04.8721989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8723826Z 2025-05-07T19:59:04.8725433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8727614Z 2025-05-07T19:59:04.8729268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:04.8731235Z 2025-05-07T19:59:05.7858533Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T19:59:05.7879175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.7881202Z 2025-05-07T19:59:05.7882755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.7884555Z 2025-05-07T19:59:05.7886399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.7888212Z 2025-05-07T19:59:05.7889875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.7891908Z 2025-05-07T19:59:05.7893535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.7895103Z 2025-05-07T19:59:05.7896542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.7898194Z 2025-05-07T19:59:06.2323302Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T19:59:06.2345005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.2346873Z 2025-05-07T19:59:06.2348331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.2350017Z 2025-05-07T19:59:06.2351587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.2353935Z 2025-05-07T19:59:06.2355712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.2357408Z 2025-05-07T19:59:06.2358851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.2360599Z 2025-05-07T19:59:06.2362144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.2363921Z 2025-05-07T19:59:07.4695963Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:07.4717221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4719134Z 2025-05-07T19:59:07.4720942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4722761Z 2025-05-07T19:59:07.4724392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4726470Z 2025-05-07T19:59:07.4728112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4729945Z 2025-05-07T19:59:07.4731561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4733511Z 2025-05-07T19:59:07.4735077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.4736739Z 2025-05-07T19:59:07.9573934Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T19:59:07.9596427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.9598361Z 2025-05-07T19:59:07.9599930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.9601867Z 2025-05-07T19:59:07.9603416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.9605322Z 2025-05-07T19:59:07.9606904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.9608549Z 2025-05-07T19:59:07.9610144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.9611896Z 2025-05-07T19:59:07.9613392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:07.9615171Z 2025-05-07T19:59:10.8873087Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T19:59:10.8896646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.8898796Z 2025-05-07T19:59:10.8900428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.8902250Z 2025-05-07T19:59:10.8903854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.8905631Z 2025-05-07T19:59:10.8907213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.8908793Z 2025-05-07T19:59:10.8910203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.8911803Z 2025-05-07T19:59:10.8913437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.8915134Z 2025-05-07T19:59:11.8007100Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:11.8031527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8033600Z 2025-05-07T19:59:11.8035224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8037117Z 2025-05-07T19:59:11.8038746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8040514Z 2025-05-07T19:59:11.8042139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8044011Z 2025-05-07T19:59:11.8045655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8047505Z 2025-05-07T19:59:11.8049099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8051029Z 2025-05-07T19:59:11.8477617Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:11.8500916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8502777Z 2025-05-07T19:59:11.8504395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8506288Z 2025-05-07T19:59:11.8507924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8509752Z 2025-05-07T19:59:11.8511413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8513431Z 2025-05-07T19:59:11.8515066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8516870Z 2025-05-07T19:59:11.8518528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8520439Z 2025-05-07T19:59:13.6218694Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:13.6242715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.6244719Z 2025-05-07T19:59:13.6246352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.6248255Z 2025-05-07T19:59:13.6249916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.6251776Z 2025-05-07T19:59:13.6253448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.6255392Z 2025-05-07T19:59:13.6257099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.6258994Z 2025-05-07T19:59:13.6260760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:13.6262670Z 2025-05-07T19:59:15.4014924Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:15.4036460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4038253Z 2025-05-07T19:59:15.4039744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4041542Z 2025-05-07T19:59:15.4042813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4044457Z 2025-05-07T19:59:15.4045954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4047643Z 2025-05-07T19:59:15.4049349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4051131Z 2025-05-07T19:59:15.4052626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4054406Z 2025-05-07T19:59:18.3711353Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T19:59:18.3732927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3734419Z 2025-05-07T19:59:18.3735730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3737121Z 2025-05-07T19:59:18.3738446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3740190Z 2025-05-07T19:59:18.3741831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3743787Z 2025-05-07T19:59:18.3745503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3747411Z 2025-05-07T19:59:18.3749133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.3751275Z 2025-05-07T19:59:18.6286313Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:18.6309990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.6311747Z 2025-05-07T19:59:18.6313484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.6315359Z 2025-05-07T19:59:18.6317008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.6318902Z 2025-05-07T19:59:18.6320601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.6322456Z 2025-05-07T19:59:18.6324138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.6326073Z 2025-05-07T19:59:18.6327700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.6329802Z 2025-05-07T19:59:19.3763414Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:19.3787613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.3789596Z 2025-05-07T19:59:19.3791281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.3793304Z 2025-05-07T19:59:19.3794939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.3796711Z 2025-05-07T19:59:19.3798284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.3800401Z 2025-05-07T19:59:19.3802017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.3803877Z 2025-05-07T19:59:19.3805747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.3807668Z 2025-05-07T19:59:21.4152971Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:21.4175028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:21.4176952Z 2025-05-07T19:59:21.4178704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:21.4180429Z 2025-05-07T19:59:21.4181894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:21.4184169Z 2025-05-07T19:59:21.4185537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:21.4187178Z 2025-05-07T19:59:21.4188872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:21.4190767Z 2025-05-07T19:59:21.4192579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:21.4194409Z 2025-05-07T19:59:27.4388759Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:27.4409915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.4411639Z 2025-05-07T19:59:27.4413187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.4414903Z 2025-05-07T19:59:27.4416690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.4418457Z 2025-05-07T19:59:27.4420193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.4421877Z 2025-05-07T19:59:27.4423454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.4425175Z 2025-05-07T19:59:27.4426741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.4428474Z 2025-05-07T19:59:32.8559429Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:32.8581460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.8583326Z 2025-05-07T19:59:32.8584931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.8587325Z 2025-05-07T19:59:32.8589193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.8591052Z 2025-05-07T19:59:32.8592794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.8594616Z 2025-05-07T19:59:32.8596144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.8597953Z 2025-05-07T19:59:32.8599544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:32.8601539Z 2025-05-07T19:59:36.2018676Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:36.2041212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.2043303Z 2025-05-07T19:59:36.2045030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.2047251Z 2025-05-07T19:59:36.2048835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.2050667Z 2025-05-07T19:59:36.2051931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.2053458Z 2025-05-07T19:59:36.2054989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.2056674Z 2025-05-07T19:59:36.2058182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:36.2060075Z 2025-05-07T19:59:40.5824255Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:40.5846653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.5849012Z 2025-05-07T19:59:40.5850575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.5852344Z 2025-05-07T19:59:40.5853841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.5855553Z 2025-05-07T19:59:40.5857076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.5859024Z 2025-05-07T19:59:40.5860548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.5862159Z 2025-05-07T19:59:40.5863602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.5865363Z 2025-05-07T19:59:40.8021503Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:40.8044380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.8046027Z 2025-05-07T19:59:40.8047450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.8049287Z 2025-05-07T19:59:40.8050782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.8052804Z 2025-05-07T19:59:40.8054422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.8056137Z 2025-05-07T19:59:40.8057668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.8059477Z 2025-05-07T19:59:40.8061041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.8062827Z 2025-05-07T19:59:43.8336375Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:43.8357485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:43.8359296Z 2025-05-07T19:59:43.8360870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:43.8362859Z 2025-05-07T19:59:43.8364437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:43.8366077Z 2025-05-07T19:59:43.8367522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:43.8369266Z 2025-05-07T19:59:43.8370672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:43.8372389Z 2025-05-07T19:59:43.8373917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:43.8375595Z 2025-05-07T19:59:46.3363926Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:46.3382237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.3383780Z 2025-05-07T19:59:46.3385122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.3387155Z 2025-05-07T19:59:46.3388461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.3389937Z 2025-05-07T19:59:46.3391273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.3392882Z 2025-05-07T19:59:46.3394153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.3395693Z 2025-05-07T19:59:46.3397301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.3399129Z 2025-05-07T19:59:48.4209417Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:48.4221902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.4222976Z 2025-05-07T19:59:48.4223860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.4224845Z 2025-05-07T19:59:48.4225724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.4226706Z 2025-05-07T19:59:48.4227583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.4228583Z 2025-05-07T19:59:48.4229457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.4230433Z 2025-05-07T19:59:48.4231326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.4232306Z 2025-05-07T19:59:56.8936185Z [381/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:59:56.8955986Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:59:59.7797613Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:59.7821338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.7823522Z 2025-05-07T19:59:59.7825137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.7826936Z 2025-05-07T19:59:59.7828791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.7830712Z 2025-05-07T19:59:59.7832641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.7834641Z 2025-05-07T19:59:59.7836372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.7838363Z 2025-05-07T19:59:59.7840105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.7842201Z 2025-05-07T20:00:05.1226110Z [383/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:00:05.1245315Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:08.0173656Z [384/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:00:08.0193757Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:08.1248737Z [385/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:08.1272298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.1274420Z 2025-05-07T20:00:08.1276083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.1277867Z 2025-05-07T20:00:08.1279533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.1281718Z 2025-05-07T20:00:08.1283452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.1285396Z 2025-05-07T20:00:08.1287356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.1289290Z 2025-05-07T20:00:08.1291002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3758333Z 2025-05-07T20:00:08.3780356Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:08.3805078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3807126Z 2025-05-07T20:00:08.3808918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3810973Z 2025-05-07T20:00:08.3812505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3814415Z 2025-05-07T20:00:08.3816177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3818155Z 2025-05-07T20:00:08.3819875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3821832Z 2025-05-07T20:00:08.3823591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.3825566Z 2025-05-07T20:00:09.5667786Z [387/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:00:09.5692236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.5694385Z 2025-05-07T20:00:09.5696167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.5698120Z 2025-05-07T20:00:09.5699801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.5701667Z 2025-05-07T20:00:09.5703386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.5705343Z 2025-05-07T20:00:09.5706895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.5708809Z 2025-05-07T20:00:09.5710476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:09.5712508Z 2025-05-07T20:00:10.1237881Z [388/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:10.1258680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1260445Z 2025-05-07T20:00:10.1261901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1263593Z 2025-05-07T20:00:10.1265177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1266996Z 2025-05-07T20:00:10.1268418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1270059Z 2025-05-07T20:00:10.1271554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1273380Z 2025-05-07T20:00:10.1274866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1276620Z 2025-05-07T20:00:10.7893483Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:10.7917338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.7919364Z 2025-05-07T20:00:10.7921137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.7923147Z 2025-05-07T20:00:10.7924897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.7926941Z 2025-05-07T20:00:10.7928667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.7930753Z 2025-05-07T20:00:10.7932402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.7933880Z 2025-05-07T20:00:10.7935399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.7937653Z 2025-05-07T20:00:11.0232230Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:11.0256283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.0258164Z 2025-05-07T20:00:11.0259933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.0261864Z 2025-05-07T20:00:11.0263551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.0265365Z 2025-05-07T20:00:11.0266923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.0268788Z 2025-05-07T20:00:11.0270353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.0272236Z 2025-05-07T20:00:11.0274040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.0275775Z 2025-05-07T20:00:11.1356512Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:00:11.1380176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1382074Z 2025-05-07T20:00:11.1383898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1386202Z 2025-05-07T20:00:11.1387868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1389835Z 2025-05-07T20:00:11.1391516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1393636Z 2025-05-07T20:00:11.1395281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1397135Z 2025-05-07T20:00:11.1398846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1400765Z 2025-05-07T20:00:11.1920242Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:11.1942356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1944302Z 2025-05-07T20:00:11.1946002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1947948Z 2025-05-07T20:00:11.1949648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1951550Z 2025-05-07T20:00:11.1953477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1955399Z 2025-05-07T20:00:11.1957112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1958996Z 2025-05-07T20:00:11.1960706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.1962642Z 2025-05-07T20:00:11.2908153Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:11.2931261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.2933089Z 2025-05-07T20:00:11.2934711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.2936545Z 2025-05-07T20:00:11.2938129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.2939948Z 2025-05-07T20:00:11.2941583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.2943390Z 2025-05-07T20:00:11.2944948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.2946580Z 2025-05-07T20:00:11.2947905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.2949895Z 2025-05-07T20:00:12.0549920Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:12.0567505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0568841Z 2025-05-07T20:00:12.0570010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0571473Z 2025-05-07T20:00:12.0572629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0573921Z 2025-05-07T20:00:12.0575067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0576325Z 2025-05-07T20:00:12.0577459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0579047Z 2025-05-07T20:00:12.0580266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0581575Z 2025-05-07T20:00:12.6693587Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:12.6723128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.6725721Z 2025-05-07T20:00:12.6727914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.6730386Z 2025-05-07T20:00:12.6732632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.6735000Z 2025-05-07T20:00:12.6737073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.6739839Z 2025-05-07T20:00:12.6741969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.6744428Z 2025-05-07T20:00:12.6746826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.6749287Z 2025-05-07T20:00:14.9292318Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:14.9315910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.9317935Z 2025-05-07T20:00:14.9319676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.9321670Z 2025-05-07T20:00:14.9323257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.9325295Z 2025-05-07T20:00:14.9326981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.9329202Z 2025-05-07T20:00:14.9331134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.9332953Z 2025-05-07T20:00:14.9334633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.9336453Z 2025-05-07T20:00:18.2323346Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:00:18.2345635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.2347499Z 2025-05-07T20:00:18.2348999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.2350743Z 2025-05-07T20:00:18.2352288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.2354478Z 2025-05-07T20:00:18.2356294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.2358134Z 2025-05-07T20:00:18.2359748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.2361525Z 2025-05-07T20:00:18.2363030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:18.2364782Z 2025-05-07T20:00:19.9403043Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:19.9425458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.9427216Z 2025-05-07T20:00:19.9428784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.9432616Z 2025-05-07T20:00:19.9434031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.9435735Z 2025-05-07T20:00:19.9437551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.9439317Z 2025-05-07T20:00:19.9440861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.9442587Z 2025-05-07T20:00:19.9444157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.9446023Z 2025-05-07T20:00:21.1643172Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:21.1665901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1667819Z 2025-05-07T20:00:21.1669405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1670988Z 2025-05-07T20:00:21.1672846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1674571Z 2025-05-07T20:00:21.1676063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1677846Z 2025-05-07T20:00:21.1679395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1681195Z 2025-05-07T20:00:21.1683023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.1684809Z 2025-05-07T20:00:24.6531553Z [400/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:00:24.6550396Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:24.7420267Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:24.7440138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7441785Z 2025-05-07T20:00:24.7443264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7444918Z 2025-05-07T20:00:24.7446366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7448060Z 2025-05-07T20:00:24.7449517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7451058Z 2025-05-07T20:00:24.7452385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7453891Z 2025-05-07T20:00:24.7455234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.7456763Z 2025-05-07T20:00:25.3827223Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:25.3849189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.3851185Z 2025-05-07T20:00:25.3852871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.3854782Z 2025-05-07T20:00:25.3856407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.3858293Z 2025-05-07T20:00:25.3859822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.3861516Z 2025-05-07T20:00:25.3863058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.3864873Z 2025-05-07T20:00:25.3866498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.3868568Z 2025-05-07T20:00:25.4680703Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:25.4702615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.4704567Z 2025-05-07T20:00:25.4706164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.4707988Z 2025-05-07T20:00:25.4709462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.4711291Z 2025-05-07T20:00:25.4713034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.4714879Z 2025-05-07T20:00:25.4716463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.4718669Z 2025-05-07T20:00:25.4720364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.4722321Z 2025-05-07T20:00:25.8168184Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:25.8192685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8194569Z 2025-05-07T20:00:25.8196250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8198157Z 2025-05-07T20:00:25.8199852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8201754Z 2025-05-07T20:00:25.8203409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8205368Z 2025-05-07T20:00:25.8206998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8208855Z 2025-05-07T20:00:25.8210680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8212443Z 2025-05-07T20:00:25.8703512Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:25.8727048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8728982Z 2025-05-07T20:00:25.8730559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8732444Z 2025-05-07T20:00:25.8734107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8735948Z 2025-05-07T20:00:25.8737541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8739694Z 2025-05-07T20:00:25.8741559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8743530Z 2025-05-07T20:00:25.8745151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.8747104Z 2025-05-07T20:00:26.5495515Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:26.5519180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5521158Z 2025-05-07T20:00:26.5522972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5524926Z 2025-05-07T20:00:26.5526649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5528779Z 2025-05-07T20:00:26.5530518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5532481Z 2025-05-07T20:00:26.5534359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5536325Z 2025-05-07T20:00:26.5538061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5540015Z 2025-05-07T20:00:26.5873559Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:26.5897005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5898903Z 2025-05-07T20:00:26.5900547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5902598Z 2025-05-07T20:00:26.5904311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5906255Z 2025-05-07T20:00:26.5908156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5910007Z 2025-05-07T20:00:26.5911638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5913520Z 2025-05-07T20:00:26.5915054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.5916922Z 2025-05-07T20:00:26.6583188Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:26.6595829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6596824Z 2025-05-07T20:00:26.6597717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6598808Z 2025-05-07T20:00:26.6599788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6600775Z 2025-05-07T20:00:26.6601668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6602661Z 2025-05-07T20:00:26.6603527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6604499Z 2025-05-07T20:00:26.6605386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6606421Z 2025-05-07T20:00:27.1191745Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:27.1203915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1205151Z 2025-05-07T20:00:27.1206035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1207025Z 2025-05-07T20:00:27.1208038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1209015Z 2025-05-07T20:00:27.1209904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1210892Z 2025-05-07T20:00:27.1211758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1212824Z 2025-05-07T20:00:27.1213703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.1214687Z 2025-05-07T20:00:27.2207487Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:27.2219638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2220640Z 2025-05-07T20:00:27.2221670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2222658Z 2025-05-07T20:00:27.2223527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2224513Z 2025-05-07T20:00:27.2225385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2226417Z 2025-05-07T20:00:27.2227284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2228331Z 2025-05-07T20:00:27.2229221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2230204Z 2025-05-07T20:00:27.4796276Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:27.4822955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.4824977Z 2025-05-07T20:00:27.4826607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.4828542Z 2025-05-07T20:00:27.4830143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.4831781Z 2025-05-07T20:00:27.4833517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.4835500Z 2025-05-07T20:00:27.4837065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.4838810Z 2025-05-07T20:00:27.4840380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.4842268Z 2025-05-07T20:00:28.0846451Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:28.0868247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.0870110Z 2025-05-07T20:00:28.0871773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.0873717Z 2025-05-07T20:00:28.0875199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.0877219Z 2025-05-07T20:00:28.0878788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.0880595Z 2025-05-07T20:00:28.0882160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.0883903Z 2025-05-07T20:00:28.0885538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.0888056Z 2025-05-07T20:00:28.6261282Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:28.6283374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6285214Z 2025-05-07T20:00:28.6287116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6289256Z 2025-05-07T20:00:28.6290944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6292754Z 2025-05-07T20:00:28.6294267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6296047Z 2025-05-07T20:00:28.6297589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6299376Z 2025-05-07T20:00:28.6300875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6302694Z 2025-05-07T20:00:28.7111380Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:28.7133991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7136210Z 2025-05-07T20:00:28.7137767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7139536Z 2025-05-07T20:00:28.7141085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7142879Z 2025-05-07T20:00:28.7144477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7146295Z 2025-05-07T20:00:28.7147895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7149694Z 2025-05-07T20:00:28.7151320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.7153225Z 2025-05-07T20:00:29.2221969Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:29.2244876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.2246679Z 2025-05-07T20:00:29.2248338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.2250192Z 2025-05-07T20:00:29.2251844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.2253531Z 2025-05-07T20:00:29.2255180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.2257079Z 2025-05-07T20:00:29.2258728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.2260554Z 2025-05-07T20:00:29.2262205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.2263972Z 2025-05-07T20:00:30.4148966Z [416/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:00:30.4167066Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:32.2219159Z [417/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:32.2241621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.2243678Z 2025-05-07T20:00:32.2245070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.2246857Z 2025-05-07T20:00:32.2248744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.2250567Z 2025-05-07T20:00:32.2252128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.2253965Z 2025-05-07T20:00:32.2255595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.2257357Z 2025-05-07T20:00:32.2258896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.2260870Z 2025-05-07T20:00:32.4506076Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:32.4518286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4519525Z 2025-05-07T20:00:32.4520558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4521557Z 2025-05-07T20:00:32.4522419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4523410Z 2025-05-07T20:00:32.4524313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4525310Z 2025-05-07T20:00:32.4526176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4527226Z 2025-05-07T20:00:32.4528121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.4529109Z 2025-05-07T20:00:33.8580075Z [419/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:00:33.8597233Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.5606361Z [420/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:35.5628361Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.6278394Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:35.6297747Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.3452936Z [422/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:36.3471893Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.5917382Z [423/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:36.5936742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5938324Z 2025-05-07T20:00:36.5939734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5941313Z 2025-05-07T20:00:36.5942821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5944367Z 2025-05-07T20:00:36.5945640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5947090Z 2025-05-07T20:00:36.5948351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5949781Z 2025-05-07T20:00:36.5951056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.5952600Z 2025-05-07T20:00:36.6725333Z [424/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:36.6738009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.6739024Z 2025-05-07T20:00:36.6739916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.6740916Z 2025-05-07T20:00:36.6741815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.6742801Z 2025-05-07T20:00:36.6743715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.6744704Z 2025-05-07T20:00:36.6745580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.6746600Z 2025-05-07T20:00:36.6747475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:36.6748471Z 2025-05-07T20:00:36.7844830Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:00:36.7855316Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.0009655Z [426/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:37.0027572Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.0693637Z [427/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:00:37.0712573Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.5578477Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:00:37.5598291Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.6495196Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:37.6513473Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.6641549Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:37.6659329Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.1166025Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:00:38.1186985Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.2532414Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:38.2553090Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.6711906Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:00:38.6731688Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.8931868Z [434/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:00:38.8952134Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:39.1132335Z [435/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:39.1152065Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:39.1558060Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:00:39.1575830Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:40.4379800Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:00:40.4398745Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:40.8645197Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:00:40.8664197Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:41.0458877Z [439/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:41.0481069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.0482998Z 2025-05-07T20:00:41.0484566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.0486670Z 2025-05-07T20:00:41.0488253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.0490025Z 2025-05-07T20:00:41.0491655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.0493454Z 2025-05-07T20:00:41.0495068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.0496802Z 2025-05-07T20:00:41.0498388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.0500237Z 2025-05-07T20:00:41.1814157Z [440/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:00:41.1832171Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:41.1966406Z [441/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:00:41.1985074Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.0077322Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:42.0098161Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.5192269Z [443/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:00:42.5210022Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.1754843Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:00:43.1774864Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.2727886Z [445/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:43.2751625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.2753573Z 2025-05-07T20:00:43.2755286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.2757151Z 2025-05-07T20:00:43.2758525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:43.2760126Z 2025-05-07T20:00:43.2761812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.2763757Z 2025-05-07T20:00:43.2765439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.2767420Z 2025-05-07T20:00:43.2768830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:43.2770472Z 2025-05-07T20:00:43.2772145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.2774072Z 2025-05-07T20:00:43.2775808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.2777745Z 2025-05-07T20:00:43.4052249Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:00:43.4072861Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.8679066Z [447/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:43.8703293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.8705226Z 2025-05-07T20:00:43.8706946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.8708736Z 2025-05-07T20:00:43.8710334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.8712620Z 2025-05-07T20:00:43.8714269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.8716077Z 2025-05-07T20:00:43.8717632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.8719502Z 2025-05-07T20:00:43.8721215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:43.8723134Z 2025-05-07T20:00:43.9471429Z [448/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:00:43.9492316Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.0117184Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:44.0137184Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.5130766Z [450/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:00:44.5148816Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.8560476Z [451/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:44.8581312Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.8787783Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:00:44.8807782Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.1761540Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:00:45.1781726Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.1922570Z [454/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:00:45.4406769Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.4426885Z [455/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:00:45.4447389Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.6921274Z [456/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:00:45.6940136Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.8516279Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:00:45.8536869Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:46.0955178Z [458/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:00:46.0976084Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:47.7928154Z [459/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:47.7950186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.7952084Z 2025-05-07T20:00:47.7953910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.7955842Z 2025-05-07T20:00:47.7957225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:47.7958849Z 2025-05-07T20:00:47.7960519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.7962446Z 2025-05-07T20:00:47.7964164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.7966107Z 2025-05-07T20:00:47.7967500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:47.7969132Z 2025-05-07T20:00:47.7971036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.7972914Z 2025-05-07T20:00:47.7974767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.7976537Z 2025-05-07T20:00:49.0658920Z [460/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:00:49.4494272Z [461/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:00:49.4511902Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:50.5971991Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:00:50.5992683Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.1616809Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:00:51.1633026Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.2301765Z [464/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:00:51.2318575Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.9045316Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:51.9063504Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:53.4759327Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:00:53.4776405Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:53.6540489Z [467/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:00:53.6558397Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:53.8627804Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:00:53.8643129Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:53.9585376Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:53.9602863Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:54.3175277Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:00:54.6790891Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:54.6840739Z [471/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T20:00:55.2727106Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:00:55.3017378Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.3033556Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:00:55.3051479Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.4769497Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:00:55.4779182Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.4920792Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:00:55.4930881Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.9303032Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:00:55.9321213Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.9622590Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:00:55.9640138Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.0078364Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:00:56.0094119Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.6038714Z [479/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:56.6063546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:56.6065474Z 2025-05-07T20:00:56.6067260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:56.6069224Z 2025-05-07T20:00:56.6070949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:56.6072963Z 2025-05-07T20:00:56.6074696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:56.6076624Z 2025-05-07T20:00:56.6078327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:56.6080368Z 2025-05-07T20:00:56.6082094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:56.6084048Z 2025-05-07T20:00:57.1415502Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:57.6029542Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.6045727Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:00:57.6062208Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:58.1560053Z [482/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:00:58.1573729Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:59.0615754Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:00:59.0634064Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:59.3823247Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:00:59.3839623Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:00.0563633Z [485/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:01:00.0582448Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:01.9315801Z [486/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:01.9336367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.9337945Z 2025-05-07T20:01:01.9339396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.9341131Z 2025-05-07T20:01:01.9342567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.9344236Z 2025-05-07T20:01:01.9345729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.9347417Z 2025-05-07T20:01:01.9348885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.9350814Z 2025-05-07T20:01:01.9352497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.9354261Z 2025-05-07T20:01:02.8753349Z [487/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:02.8774257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8776003Z 2025-05-07T20:01:02.8777348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8779208Z 2025-05-07T20:01:02.8780669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8782295Z 2025-05-07T20:01:02.8783827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8786339Z 2025-05-07T20:01:02.8787773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8789495Z 2025-05-07T20:01:02.8791248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8793025Z 2025-05-07T20:01:02.8941471Z [488/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:02.8962379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8964130Z 2025-05-07T20:01:02.8965663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8967359Z 2025-05-07T20:01:02.8968836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8970723Z 2025-05-07T20:01:02.8972227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8973977Z 2025-05-07T20:01:02.8975853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8977482Z 2025-05-07T20:01:02.8978933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.8980581Z 2025-05-07T20:01:03.0349017Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:01:03.0371715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.0373741Z 2025-05-07T20:01:03.0375248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.0377254Z 2025-05-07T20:01:03.0378822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.0380536Z 2025-05-07T20:01:03.0382369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.0384115Z 2025-05-07T20:01:03.0385630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.0387579Z 2025-05-07T20:01:03.0389093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:03.0390708Z 2025-05-07T20:01:03.3751823Z [490/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:01:03.3769659Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.5800311Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:01:03.5818319Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.4901079Z [492/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:01:04.4924475Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.7424210Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:01:04.7447366Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.2517692Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:01:05.2535324Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.4494367Z [495/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:01:05.4513016Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:06.2384033Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:01:06.2398214Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:06.2591775Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:01:06.2605806Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.8022164Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:01:07.8040743Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:08.4054730Z [499/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:01:08.4072920Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:08.7215482Z [500/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:01:08.7232476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:09.3673085Z [501/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:09.3694972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.3696792Z 2025-05-07T20:01:09.3698320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.3700073Z 2025-05-07T20:01:09.3701592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.3703335Z 2025-05-07T20:01:09.3704918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.3706677Z 2025-05-07T20:01:09.3708168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.3710157Z 2025-05-07T20:01:09.3711971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:09.3713921Z 2025-05-07T20:01:09.4477706Z [502/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:01:09.4495306Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:10.7807988Z [503/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:01:10.7825556Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:10.9313300Z [504/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:01:10.9330470Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.9690891Z [505/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:01:12.9706932Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:13.8444226Z [506/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:13.8465503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:13.8467120Z 2025-05-07T20:01:13.8468566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:13.8470637Z 2025-05-07T20:01:13.8472542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:13.8474289Z 2025-05-07T20:01:13.8475790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:13.8477490Z 2025-05-07T20:01:13.8478910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:13.8480562Z 2025-05-07T20:01:13.8482037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:13.8483931Z 2025-05-07T20:01:15.8593346Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:15.8614945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.8617278Z 2025-05-07T20:01:15.8618972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.8620888Z 2025-05-07T20:01:15.8622928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.8624748Z 2025-05-07T20:01:15.8626395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.8628085Z 2025-05-07T20:01:15.8629651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.8631623Z 2025-05-07T20:01:15.8633301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.8635239Z 2025-05-07T20:01:16.0906823Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:16.0919236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.0920235Z 2025-05-07T20:01:16.0921317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.0922314Z 2025-05-07T20:01:16.0923177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.0924167Z 2025-05-07T20:01:16.0925038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.0926028Z 2025-05-07T20:01:16.0926904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.0927959Z 2025-05-07T20:01:16.0928835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.0929833Z 2025-05-07T20:01:17.5093660Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:01:17.5115612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.5117632Z 2025-05-07T20:01:17.5119357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.5121315Z 2025-05-07T20:01:17.5123028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.5124866Z 2025-05-07T20:01:17.5126123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.5127841Z 2025-05-07T20:01:17.5129337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.5130885Z 2025-05-07T20:01:17.5132528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.5134359Z 2025-05-07T20:01:17.8040479Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:17.8062543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.8064315Z 2025-05-07T20:01:17.8065797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.8067453Z 2025-05-07T20:01:17.8068856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.8070752Z 2025-05-07T20:01:17.8072552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.8074315Z 2025-05-07T20:01:17.8075854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.8077648Z 2025-05-07T20:01:17.8079158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.8080954Z 2025-05-07T20:01:18.1184052Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:18.1207164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.1208792Z 2025-05-07T20:01:18.1210238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.1211989Z 2025-05-07T20:01:18.1213670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.1215785Z 2025-05-07T20:01:18.1217501Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.1219428Z 2025-05-07T20:01:18.1221110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.1223024Z 2025-05-07T20:01:18.1224728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.1226693Z 2025-05-07T20:01:18.5319383Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:01:18.5340525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5342490Z 2025-05-07T20:01:18.5344138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5346296Z 2025-05-07T20:01:18.5347869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5349768Z 2025-05-07T20:01:18.5351372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5353417Z 2025-05-07T20:01:18.5355051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5356919Z 2025-05-07T20:01:18.5358591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.5360458Z 2025-05-07T20:01:18.9020633Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:01:18.9042806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.9044653Z 2025-05-07T20:01:18.9046350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.9048060Z 2025-05-07T20:01:18.9049549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.9051252Z 2025-05-07T20:01:18.9052670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.9054269Z 2025-05-07T20:01:18.9055797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.9057749Z 2025-05-07T20:01:18.9059452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:18.9061095Z 2025-05-07T20:01:19.8771513Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:19.8794635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.8796720Z 2025-05-07T20:01:19.8798273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.8800017Z 2025-05-07T20:01:19.8801576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.8803388Z 2025-05-07T20:01:19.8804998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.8806866Z 2025-05-07T20:01:19.8808480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.8810272Z 2025-05-07T20:01:19.8811856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.8813583Z 2025-05-07T20:01:20.3961277Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:01:20.3973219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.3974224Z 2025-05-07T20:01:20.3975098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.3976083Z 2025-05-07T20:01:20.3976960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.3977935Z 2025-05-07T20:01:20.3978803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.3979808Z 2025-05-07T20:01:20.3980672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.3981640Z 2025-05-07T20:01:20.3982520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.3983501Z 2025-05-07T20:01:21.3466839Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:01:21.3489657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3491683Z 2025-05-07T20:01:21.3493278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3495095Z 2025-05-07T20:01:21.3496777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3498662Z 2025-05-07T20:01:21.3500358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3502253Z 2025-05-07T20:01:21.3503961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3505884Z 2025-05-07T20:01:21.3507570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3509474Z 2025-05-07T20:01:23.7053795Z [517/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:01:23.7070322Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:27.9320794Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:01:29.1346819Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:01:29.1369185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.1371076Z 2025-05-07T20:01:29.1372766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.1374574Z 2025-05-07T20:01:29.1376173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.1378025Z 2025-05-07T20:01:29.1379658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.1381595Z 2025-05-07T20:01:29.1383184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.1385122Z 2025-05-07T20:01:29.1387054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:29.1388914Z 2025-05-07T20:01:29.8608475Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:01:35.3574560Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:01:36.7998807Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:01:36.8021403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.8023331Z 2025-05-07T20:01:36.8024996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.8026928Z 2025-05-07T20:01:36.8028628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.8030494Z 2025-05-07T20:01:36.8032168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.8034227Z 2025-05-07T20:01:36.8035844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.8038053Z 2025-05-07T20:01:36.8039798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:36.8041757Z 2025-05-07T20:01:43.6390285Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:01:46.3281773Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:01:46.7482712Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:01:46.7505291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.7507277Z 2025-05-07T20:01:46.7508971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.7511167Z 2025-05-07T20:01:46.7512975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.7514884Z 2025-05-07T20:01:46.7516830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.7518763Z 2025-05-07T20:01:46.7520464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.7522322Z 2025-05-07T20:01:46.7523773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.7525410Z 2025-05-07T20:01:47.9791199Z [526/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:01:47.9807027Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:49.2001756Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:01:49.2024203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:49.2026070Z 2025-05-07T20:01:49.2027714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:49.2029554Z 2025-05-07T20:01:49.2031174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:49.2033223Z 2025-05-07T20:01:49.2034934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:49.2036807Z 2025-05-07T20:01:49.2038412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:49.2040213Z 2025-05-07T20:01:49.2041881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:49.2043791Z 2025-05-07T20:01:49.7736451Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:01:49.9176656Z [529/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:01:49.9810210Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:01:51.1406531Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:01:53.3220277Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:01:53.5639917Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:01:54.1663636Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:01:55.1317188Z [535/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:01:55.1464139Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:01:55.1466256Z ################################################################################ 2025-05-07T20:01:55.1466876Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.1467691Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:01:55.1468573Z Removing all RPATHs ... 2025-05-07T20:01:55.1469071Z ################################################################################ 2025-05-07T20:01:55.2193776Z [537/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:01:55.2230940Z [538/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 1 2025-05-07T20:01:55.2233194Z ################################################################################ 2025-05-07T20:01:55.2233772Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.2234662Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:01:55.2235591Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.2236244Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.2236972Z ################################################################################ 2025-05-07T20:01:55.3175029Z [539/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:01:55.3306149Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:55.3308730Z ################################################################################ 2025-05-07T20:01:55.3309409Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.3310480Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:55.3311545Z Removing all RPATHs ... 2025-05-07T20:01:55.3312031Z ################################################################################ 2025-05-07T20:01:55.3559849Z [541/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:01:55.3581893Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:55.3584197Z ################################################################################ 2025-05-07T20:01:55.3585000Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.3586277Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:55.3587236Z Removing all RPATHs ... 2025-05-07T20:01:55.3587680Z ################################################################################ 2025-05-07T20:01:55.3847029Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:01:55.3849405Z ################################################################################ 2025-05-07T20:01:55.3850040Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.3851086Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:01:55.3855056Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.3855760Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.3856498Z ################################################################################ 2025-05-07T20:01:55.3858749Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:01:55.3861027Z ################################################################################ 2025-05-07T20:01:55.3861639Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.3862788Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:01:55.3863949Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.3864657Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.3865428Z ################################################################################ 2025-05-07T20:01:55.4923112Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:55.4925112Z ################################################################################ 2025-05-07T20:01:55.4925637Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.4926534Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:55.4927464Z Removing all RPATHs ... 2025-05-07T20:01:55.4927864Z ################################################################################ 2025-05-07T20:01:55.4929690Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:55.4931573Z ################################################################################ 2025-05-07T20:01:55.4932117Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.4933032Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:55.4933953Z Removing all RPATHs ... 2025-05-07T20:01:55.4934397Z ################################################################################ 2025-05-07T20:01:55.6489556Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:01:55.6492133Z ################################################################################ 2025-05-07T20:01:55.6492776Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.6494251Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:01:55.6495542Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.6496253Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.6497171Z ################################################################################ 2025-05-07T20:01:55.8003264Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:01:55.8005988Z ################################################################################ 2025-05-07T20:01:55.8006890Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.8008155Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:01:55.8009454Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.8010182Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.8010994Z ################################################################################ 2025-05-07T20:01:56.2305637Z [549/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:01:56.2308595Z ################################################################################ 2025-05-07T20:01:56.2309368Z [CMAKE] Running post-build script ... 2025-05-07T20:01:56.2310723Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:01:56.2312138Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:56.2313090Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:56.2313989Z ################################################################################ 2025-05-07T20:01:56.4960970Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:01:57.1890966Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:01:57.9163917Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:01:57.9268114Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:01:57.9288457Z In file included from tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:1: 2025-05-07T20:01:57.9291159Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9301774Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:01:57.9311806Z ^ 2025-05-07T20:01:57.9314315Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9317329Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9320336Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9329475Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:01:57.9337810Z ^ 2025-05-07T20:01:57.9340146Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9343111Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9346264Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:54:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9349354Z /tmp/tmpxft_000097ca_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:54:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.9351111Z 8 warnings generated. 2025-05-07T20:01:57.9640443Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:01:58.3334809Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:01:59.0395253Z [556/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:01:59.0397020Z ################################################################################ 2025-05-07T20:01:59.0397521Z [CMAKE] Running post-build script ... 2025-05-07T20:01:59.0398316Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:01:59.0399113Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:59.0399644Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:59.0400210Z ################################################################################ 2025-05-07T20:01:59.2876728Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:01:59.6078272Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:01:59.8747321Z [559/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:01:59.8750355Z ################################################################################ 2025-05-07T20:01:59.8751098Z [CMAKE] Running post-build script ... 2025-05-07T20:01:59.8752590Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:01:59.8754200Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:59.8755044Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:59.8755947Z ################################################################################ 2025-05-07T20:02:00.1658451Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:02:01.9590878Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:02:02.7364442Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:02:02.7375446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7376072Z 2025-05-07T20:02:02.7376666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7377280Z 2025-05-07T20:02:02.7377789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7378419Z 2025-05-07T20:02:02.7378989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7379611Z 2025-05-07T20:02:02.7380124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7380732Z 2025-05-07T20:02:02.7381231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7381914Z 2025-05-07T20:02:02.7382419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7383036Z 2025-05-07T20:02:02.7383578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7384194Z 2025-05-07T20:02:02.7384697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7385317Z 2025-05-07T20:02:02.7386052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7386668Z 2025-05-07T20:02:02.7387183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7387799Z 2025-05-07T20:02:02.7388299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.7388927Z 2025-05-07T20:02:03.1176344Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:02:03.8923281Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:02:03.8944233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8945503Z 2025-05-07T20:02:03.8946527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8947752Z 2025-05-07T20:02:03.8948789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8950083Z 2025-05-07T20:02:03.8951137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8952672Z 2025-05-07T20:02:03.8953752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8954999Z 2025-05-07T20:02:03.8956018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8957294Z 2025-05-07T20:02:03.8958438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8959668Z 2025-05-07T20:02:03.8960719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8961952Z 2025-05-07T20:02:03.8963086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8964348Z 2025-05-07T20:02:03.8965353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8966588Z 2025-05-07T20:02:03.8967688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8968844Z 2025-05-07T20:02:03.8969850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8971091Z 2025-05-07T20:02:03.8972094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8973320Z 2025-05-07T20:02:03.8974358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8975628Z 2025-05-07T20:02:03.8976630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.8977896Z 2025-05-07T20:02:04.1270425Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:02:04.1292106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:04.1293388Z 2025-05-07T20:02:04.1294567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:04.1295795Z 2025-05-07T20:02:04.1296863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:04.1298052Z 2025-05-07T20:02:04.1299091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:04.1300478Z 2025-05-07T20:02:04.1301474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:04.1302737Z 2025-05-07T20:02:04.1303730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:04.1304965Z 2025-05-07T20:02:05.4398599Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:02:05.4409908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4410607Z 2025-05-07T20:02:05.4411226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4411885Z 2025-05-07T20:02:05.4412449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4413105Z 2025-05-07T20:02:05.4413706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4414388Z 2025-05-07T20:02:05.4414929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4415585Z 2025-05-07T20:02:05.4416144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4416863Z 2025-05-07T20:02:05.4417404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4418079Z 2025-05-07T20:02:05.4418623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4419282Z 2025-05-07T20:02:05.4419845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4420505Z 2025-05-07T20:02:05.4421050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4421725Z 2025-05-07T20:02:05.4422259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4422906Z 2025-05-07T20:02:05.4423468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4424116Z 2025-05-07T20:02:05.4424657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4425336Z 2025-05-07T20:02:05.4425869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4426516Z 2025-05-07T20:02:05.4427060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:05.4427705Z 2025-05-07T20:02:07.1707700Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:02:07.1718641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:07.1719240Z 2025-05-07T20:02:07.1719733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:07.1720321Z 2025-05-07T20:02:07.1720793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:07.1721391Z 2025-05-07T20:02:07.5741768Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:02:11.2963872Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:02:11.2984097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.2986073Z 2025-05-07T20:02:11.2987997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.2989799Z 2025-05-07T20:02:11.2991390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.2993305Z 2025-05-07T20:02:11.2994942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.2996751Z 2025-05-07T20:02:11.2998343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3000426Z 2025-05-07T20:02:11.3002010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.3003818Z 2025-05-07T20:02:11.7353214Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:02:11.7373901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.7375715Z 2025-05-07T20:02:11.7377306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.7378887Z 2025-05-07T20:02:11.7379866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:11.7381075Z 2025-05-07T20:02:11.7382711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.7384473Z 2025-05-07T20:02:11.7386324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.7388350Z 2025-05-07T20:02:11.7389374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:11.7390615Z 2025-05-07T20:02:11.7392440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.7394257Z 2025-05-07T20:02:11.7395785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.7397478Z 2025-05-07T20:02:14.5538602Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:02:14.9376520Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:02:14.9396587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9398451Z 2025-05-07T20:02:14.9400053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9401823Z 2025-05-07T20:02:14.9403330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9405090Z 2025-05-07T20:02:14.9406670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9408449Z 2025-05-07T20:02:14.9410013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9411724Z 2025-05-07T20:02:14.9413247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.9415015Z 2025-05-07T20:02:15.2415895Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:02:15.2434755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.2436565Z 2025-05-07T20:02:15.2437991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.2439471Z 2025-05-07T20:02:15.2441007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.2442622Z 2025-05-07T20:02:15.2444071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.2445765Z 2025-05-07T20:02:15.2447313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.2448896Z 2025-05-07T20:02:15.2450341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.2451975Z 2025-05-07T20:02:16.5184905Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:02:17.1235569Z [575/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:02:17.1321709Z [576/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:17.1323579Z ################################################################################ 2025-05-07T20:02:17.1324119Z [CMAKE] Running post-build script ... 2025-05-07T20:02:17.1324988Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:17.1325906Z Removing all RPATHs ... 2025-05-07T20:02:17.1326422Z ################################################################################ 2025-05-07T20:02:17.5376663Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:02:17.5388798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5389804Z 2025-05-07T20:02:17.5390679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5391683Z 2025-05-07T20:02:17.5392645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5393744Z 2025-05-07T20:02:17.5394633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5395619Z 2025-05-07T20:02:17.5396557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5397532Z 2025-05-07T20:02:17.5398461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.5399458Z 2025-05-07T20:02:18.2857547Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:02:18.2868969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.2869969Z 2025-05-07T20:02:18.2870844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.2871825Z 2025-05-07T20:02:18.2872819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.2873906Z 2025-05-07T20:02:18.2874783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.2875783Z 2025-05-07T20:02:18.2876709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.2877681Z 2025-05-07T20:02:18.2878621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:18.2879598Z 2025-05-07T20:02:20.4055021Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:02:20.4066671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.4067680Z 2025-05-07T20:02:20.4068554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.4069538Z 2025-05-07T20:02:20.4070420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.4071480Z 2025-05-07T20:02:20.4072452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.4073442Z 2025-05-07T20:02:20.4074376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.4075361Z 2025-05-07T20:02:20.4076308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.4077290Z 2025-05-07T20:02:21.3707430Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:02:23.3423454Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:02:23.3434721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.3435727Z 2025-05-07T20:02:23.3436606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.3437594Z 2025-05-07T20:02:23.3438492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.3439473Z 2025-05-07T20:02:23.3440360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.3441361Z 2025-05-07T20:02:23.3442225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.3443213Z 2025-05-07T20:02:23.3444082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.3445067Z 2025-05-07T20:02:25.0036929Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:02:25.0048202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.0049198Z 2025-05-07T20:02:25.0050099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.0051089Z 2025-05-07T20:02:25.0051952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.0052944Z 2025-05-07T20:02:25.0053819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.0054815Z 2025-05-07T20:02:25.0055689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.0056664Z 2025-05-07T20:02:25.0057545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:25.0058523Z 2025-05-07T20:02:30.6662702Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:02:30.6674146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.6675142Z 2025-05-07T20:02:30.6676034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.6677026Z 2025-05-07T20:02:30.6677887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.6678878Z 2025-05-07T20:02:30.6679761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.6680750Z 2025-05-07T20:02:30.6681616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.6682592Z 2025-05-07T20:02:30.6683477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.6684456Z 2025-05-07T20:02:34.6538637Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:02:34.6550103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.6551095Z 2025-05-07T20:02:34.6551967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.6553077Z 2025-05-07T20:02:34.6553950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.6554943Z 2025-05-07T20:02:34.6555819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.6556808Z 2025-05-07T20:02:34.6557686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.6558655Z 2025-05-07T20:02:34.6559525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:34.6560516Z 2025-05-07T20:02:36.4064500Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:02:36.4075862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.4076876Z 2025-05-07T20:02:36.4077758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.4078779Z 2025-05-07T20:02:36.4079644Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.4080619Z 2025-05-07T20:02:36.4081510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.4082495Z 2025-05-07T20:02:36.4083361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.4084349Z 2025-05-07T20:02:36.4085219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.4086391Z 2025-05-07T20:02:36.6167790Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:02:36.6179097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.6180104Z 2025-05-07T20:02:36.6180981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.6181971Z 2025-05-07T20:02:36.6182849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.6183832Z 2025-05-07T20:02:36.6184705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.6185908Z 2025-05-07T20:02:36.6186812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.6187801Z 2025-05-07T20:02:36.6188674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.6189658Z 2025-05-07T20:02:37.1951090Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:02:37.1962499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1963504Z 2025-05-07T20:02:37.1964401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1965395Z 2025-05-07T20:02:37.1966269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1967241Z 2025-05-07T20:02:37.1968117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1969116Z 2025-05-07T20:02:37.1969980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1970960Z 2025-05-07T20:02:37.1971829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1972811Z 2025-05-07T20:02:37.3604355Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:02:37.3615519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.3616507Z 2025-05-07T20:02:37.3617379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.3618388Z 2025-05-07T20:02:37.3619250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.3620225Z 2025-05-07T20:02:37.3621129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.3622109Z 2025-05-07T20:02:37.3622988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.3623963Z 2025-05-07T20:02:37.3624842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.3625837Z 2025-05-07T20:02:38.5071035Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:02:38.5082277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.5083266Z 2025-05-07T20:02:38.5084157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.5085143Z 2025-05-07T20:02:38.5086301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.5087392Z 2025-05-07T20:02:38.5088298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.5089276Z 2025-05-07T20:02:38.5090158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.5091133Z 2025-05-07T20:02:38.5092020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.5093002Z 2025-05-07T20:02:40.0457374Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:02:40.0468678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0469667Z 2025-05-07T20:02:40.0470560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0471559Z 2025-05-07T20:02:40.0472521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0473526Z 2025-05-07T20:02:40.0474404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0475401Z 2025-05-07T20:02:40.0476269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0477247Z 2025-05-07T20:02:40.0478135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0479127Z 2025-05-07T20:02:40.3230972Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:02:40.3242291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3243303Z 2025-05-07T20:02:40.3244185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3245265Z 2025-05-07T20:02:40.3246117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3247068Z 2025-05-07T20:02:40.3247919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3248887Z 2025-05-07T20:02:40.3249728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3250684Z 2025-05-07T20:02:40.3251534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3252498Z 2025-05-07T20:02:41.8230515Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:02:41.8249964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8251605Z 2025-05-07T20:02:41.8253085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8254828Z 2025-05-07T20:02:41.8256280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8257999Z 2025-05-07T20:02:41.8259430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8261073Z 2025-05-07T20:02:41.8262542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8264210Z 2025-05-07T20:02:41.8265711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8267387Z 2025-05-07T20:02:42.2631885Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:02:42.2651509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.2653228Z 2025-05-07T20:02:42.2654751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.2656468Z 2025-05-07T20:02:42.2657967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.2659613Z 2025-05-07T20:02:42.2661115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.2662740Z 2025-05-07T20:02:42.2664195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.2665849Z 2025-05-07T20:02:42.2667291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.2668991Z 2025-05-07T20:02:44.9310945Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:44.9333017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.9334682Z 2025-05-07T20:02:44.9336196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.9337877Z 2025-05-07T20:02:44.9339330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.9341020Z 2025-05-07T20:02:44.9342496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.9344152Z 2025-05-07T20:02:44.9345600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.9347297Z 2025-05-07T20:02:44.9348743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.9350414Z 2025-05-07T20:02:45.2869280Z [595/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:45.2888068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2889559Z 2025-05-07T20:02:45.2890885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2892363Z 2025-05-07T20:02:45.2893658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2895110Z 2025-05-07T20:02:45.2896428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2898102Z 2025-05-07T20:02:45.2899274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2900611Z 2025-05-07T20:02:45.2901967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.2903565Z 2025-05-07T20:02:51.0599083Z [596/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:02:51.8061723Z [597/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:02:51.9091308Z [598/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:02:52.4322967Z ################################################################################ 2025-05-07T20:02:52.4324124Z [CMAKE] Running post-build script ... 2025-05-07T20:02:52.4325814Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:02:52.4326381Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:52.4326776Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:52.4327193Z ################################################################################ 2025-05-07T20:02:52.4338323Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:52.4349686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4350600Z 2025-05-07T20:02:52.4351429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4352427Z 2025-05-07T20:02:52.4353440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4354435Z 2025-05-07T20:02:52.4355316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4356309Z 2025-05-07T20:02:52.4357176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4358207Z 2025-05-07T20:02:52.4359177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4360086Z 2025-05-07T20:02:53.8033659Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:02:54.4658920Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:02:54.4790856Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build -L"/github/home/miniconda/envs/build_binary/lib/stubs" -L"/github/home/miniconda/envs/build_binary/lib" && : 2025-05-07T20:02:54.5629969Z [603/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && : 2025-05-07T20:02:54.5796699Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:02:54.5798056Z ################################################################################ 2025-05-07T20:02:54.5798407Z [CMAKE] Running post-build script ... 2025-05-07T20:02:54.5799069Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:54.5799908Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:54.5800294Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:54.5800707Z ################################################################################ 2025-05-07T20:02:56.4736250Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:02:56.4737614Z ################################################################################ 2025-05-07T20:02:56.4737994Z [CMAKE] Running post-build script ... 2025-05-07T20:02:56.4738737Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:56.4739408Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:56.4739809Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:56.4740361Z ################################################################################ 2025-05-07T20:02:57.8286586Z [606/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:02:57.8288194Z ################################################################################ 2025-05-07T20:02:57.8288582Z [CMAKE] Running post-build script ... 2025-05-07T20:02:57.8289337Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:57.8290082Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:57.8290466Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:57.8290863Z ################################################################################ 2025-05-07T20:02:59.6806291Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:02:59.6810141Z ################################################################################ 2025-05-07T20:02:59.6810970Z [CMAKE] Running post-build script ... 2025-05-07T20:02:59.6811559Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:59.6812171Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:59.6812531Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:59.6812943Z ################################################################################ 2025-05-07T20:02:59.6813895Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.11/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:02:59.6851929Z -- Install configuration: "Release" 2025-05-07T20:02:59.6853756Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:02:59.6876091Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:02:59.6878810Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:02:59.6897646Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:02:59.6900223Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:02:59.6924035Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:02:59.6947288Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:02:59.6948511Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:02:59.6949620Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:02:59.6962626Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:02:59.6963938Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:02:59.6965104Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:59.6966201Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:59.6967282Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:59.6968407Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:59.6969626Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:02:59.6970710Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:02:59.6971865Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:02:59.6973171Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:02:59.6974425Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:02:59.6975663Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:02:59.6976868Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:02:59.6978108Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:02:59.6979442Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:02:59.6980796Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:02:59.6982091Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:02:59.6983435Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:02:59.6984812Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:02:59.6986346Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:02:59.6987570Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:02:59.6988879Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:02:59.6990292Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:02:59.6991616Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:02:59.6992838Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:59.7008913Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:02:59.7053392Z 2025-05-07T20:02:59.7096377Z 2025-05-07T20:02:59.7096845Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:02:59.7098436Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:02:59.7101506Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:02:59.7102237Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:02:59.7103214Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:02:59.7104804Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:02:59.7105865Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:02:59.7106735Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:02:59.7107587Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:02:59.7108425Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:02:59.7109328Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:02:59.7110428Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:02:59.7113217Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:02:59.7114263Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:02:59.7115337Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:02:59.7116593Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:02:59.7117919Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:02:59.7119278Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:02:59.7120678Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:02:59.7122137Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:02:59.7123253Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:02:59.7124133Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:02:59.7124780Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config 2025-05-07T20:02:59.7125522Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:02:59.7126516Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:02:59.7127285Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs 2025-05-07T20:02:59.7128002Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:02:59.7128807Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:02:59.7129640Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:02:59.7130606Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:02:59.7131673Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:02:59.7132838Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:02:59.7133882Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:02:59.7134784Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:02:59.7135622Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:02:59.7136364Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:02:59.7137136Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:02:59.7138065Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:02:59.7138868Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll 2025-05-07T20:02:59.7139557Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:02:59.7140264Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:02:59.7140967Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:02:59.7141662Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton 2025-05-07T20:02:59.7142406Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:02:59.7143267Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:02:59.7144160Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:02:59.7145113Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:02:59.7145891Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils 2025-05-07T20:02:59.7146627Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:02:59.7147517Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:02:59.7148385Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:02:59.7149279Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:02:59.7150120Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7150871Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:02:59.7151747Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:02:59.7152624Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7153389Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:02:59.7154268Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:02:59.7155051Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7156263Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:02:59.7157169Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:02:59.7158320Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:02:59.7159845Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:02:59.7161062Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:02:59.7162245Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:02:59.7163575Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:02:59.7165073Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:02:59.7166580Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:02:59.7167978Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:02:59.7169454Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:02:59.7170806Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:02:59.7172123Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:02:59.7173206Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7173989Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:02:59.7174972Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:02:59.7175948Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:02:59.7176870Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:02:59.7177991Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:02:59.7179167Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:02:59.7180200Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:02:59.7181198Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:02:59.7182304Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:02:59.7183520Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:02:59.7184591Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:02:59.7185342Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7186287Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:02:59.7187363Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:02:59.7188280Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7189033Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:02:59.7189914Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:02:59.7190801Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:02:59.7191716Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:02:59.7192627Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7193408Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:02:59.7194447Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:02:59.7195332Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7196114Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:02:59.7197005Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:02:59.7197924Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:02:59.7198854Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:02:59.7199781Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:02:59.7200585Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7201480Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:02:59.7202672Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:02:59.7203757Z creating directory _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7204569Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:02:59.7205778Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:02:59.7206514Z 2025-05-07T20:02:59.7275106Z INFO:root:running bdist_wheel 2025-05-07T20:02:59.7311187Z INFO:root:running build 2025-05-07T20:02:59.7311614Z INFO:root:running build_py 2025-05-07T20:02:59.7313579Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7315405Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7317316Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7318816Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7320547Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7322302Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7324345Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7325782Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7327852Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7329331Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7330780Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7332632Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7334151Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7335641Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7337455Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7338942Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7340648Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7342269Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7343843Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7346485Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7348048Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7349501Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7350857Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7364638Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:02:59.7365925Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:02:59.7367335Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:02:59.7368458Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7369548Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7370879Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7372233Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7373620Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7375092Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7376681Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7378142Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7379509Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7380879Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:02:59.7382059Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:02:59.7383188Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:02:59.7384668Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:02:59.7385969Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:02:59.7387099Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:02:59.7388151Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:02:59.7389185Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:02:59.7390303Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:02:59.7391408Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:02:59.7392843Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:02:59.7394255Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:02:59.7395683Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:02:59.7396782Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:02:59.7397866Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:02:59.7399227Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:02:59.7400621Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:02:59.7402020Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:02:59.7403123Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7404236Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7405622Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:02:59.7406718Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7407836Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7409302Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:02:59.7410423Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7411625Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7413080Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7414672Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7416348Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7417973Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7419544Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7421208Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7422950Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7424667Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7426373Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7428097Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7429767Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7431409Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:02:59.7432762Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7433902Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7435343Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7436815Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7438304Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7439843Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7441420Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7442948Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7444408Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7445935Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7447531Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7449040Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:02:59.7450183Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7451325Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7452826Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:02:59.7454215Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7455325Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7456701Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7458120Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7459558Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:59.7461608Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7462759Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7464315Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:02:59.7466092Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7467270Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7468965Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7470571Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7472067Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7473595Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:02:59.7475123Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7476304Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7477989Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:59.7479627Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7480933Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7482518Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:02:59.7522442Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7554561Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.7751244Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:02:59.8092557Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:01.4983062Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:01.4986834Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:01.5611131Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:01.5662353Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:01.5789105Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:01.6141952Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:03.1081283Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:03.1897849Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:07.1274743Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:07.7299775Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.1277778Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.3681447Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.4054036Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.5473671Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5475282Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5479584Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5491223Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5496511Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5506590Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5512759Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5524179Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5531165Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5541153Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5548298Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5554762Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5562015Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5572062Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5577975Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:09.5583817Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:09.5585497Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:09.5592057Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:09.5602534Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.5630735Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8646693Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8648188Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8649591Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8650929Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8652369Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8653927Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8655574Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8656948Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8658333Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8659686Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8661210Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8662724Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8664267Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8665862Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8667318Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8668847Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8670470Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8672022Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8673804Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8675769Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8677250Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8678563Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu 2025-05-07T20:03:09.8679879Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:03:09.8681370Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config 2025-05-07T20:03:09.8682762Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8684333Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8686297Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8688120Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8689696Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8691703Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8693436Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8695490Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8697119Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs 2025-05-07T20:03:09.8698774Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:03:09.8702137Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize 2025-05-07T20:03:09.8703541Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll 2025-05-07T20:03:09.8705121Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe 2025-05-07T20:03:09.8706709Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:03:09.8708212Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:03:09.8709602Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:03:09.8711273Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton 2025-05-07T20:03:09.8712846Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:03:09.8714488Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:03:09.8715899Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:03:09.8717369Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils 2025-05-07T20:03:09.8718842Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:03:09.8720379Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu 2025-05-07T20:03:09.8722085Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:03:09.8723648Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta 2025-05-07T20:03:09.8725163Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8727129Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8728794Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8730473Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8732070Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8733651Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8735313Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8737040Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8738783Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8740485Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8742199Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8743877Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8745533Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8747118Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8748590Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8750066Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8751544Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8753152Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8754728Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8757378Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8758887Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8760673Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8762267Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8763788Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8765332Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:03:09.8766927Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache 2025-05-07T20:03:09.8768492Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8769959Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8771453Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8772933Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8774973Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:03:09.8777235Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats 2025-05-07T20:03:09.8778803Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8780376Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8781889Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8783437Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8784965Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8786812Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:09.8788632Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:09.8790385Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:03:09.8792038Z INFO:root:copying _skbuild/linux-x86_64-3.11/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged 2025-05-07T20:03:09.8807979Z INFO:skbuild:copied 90 files 2025-05-07T20:03:09.8808283Z INFO:root:running build_ext 2025-05-07T20:03:09.8809932Z INFO:root:installing to _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:09.8810421Z INFO:root:running install 2025-05-07T20:03:09.8867970Z INFO:root:running install_lib 2025-05-07T20:03:09.8868549Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:09.8869247Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:03:09.8870141Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:03:09.8871431Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:03:09.8873241Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:03:09.8874509Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:03:09.8875719Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8877392Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8879023Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8880749Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8882480Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8884465Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8886456Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8888161Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8889844Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:09.8891127Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:03:09.8892335Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:03:09.8894192Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:03:09.8895457Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:03:09.8896340Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:03:09.8897611Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:03:09.8899381Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:03:09.8900694Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:03:09.8901992Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:03:09.8903786Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:03:09.8905114Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8906429Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8908181Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8910092Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8912144Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8914160Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8916163Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8918165Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8920304Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8922425Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8924461Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8926581Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8928569Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8930572Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:09.8932462Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:03:09.8933706Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:03:09.8934595Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8935894Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8937731Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8939485Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8941259Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8943110Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8945097Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8946931Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8948821Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8950697Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8952691Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8954588Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:09.8955944Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:03:09.8957249Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:03:09.8959144Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:03:09.8960541Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8961441Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:09.8962805Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:09.8964757Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:09.8966573Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8968166Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8969852Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8971471Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:09.8972678Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:03:09.8973874Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:03:09.8975603Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:03:09.8976880Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8978124Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8979786Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8981435Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8983076Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8984763Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:09.8986491Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:03:09.8987628Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:03:09.8988438Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:03:09.8989714Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:03:09.8991481Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:03:09.8993253Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:09.8994828Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:09.8996406Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:09.8998025Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:09.8999225Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:03:09.9000372Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:09.9001958Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:09.9003605Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:09.9005249Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:09.9006969Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:09.9008604Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:09.9010313Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:09.9075488Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.0404220Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.0405878Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.0460367Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.0463272Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.0475217Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.0509993Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.1671887Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.1737762Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.4748078Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.5213338Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6289875Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6475516Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6508615Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6620872Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6622638Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6625128Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6627484Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6629876Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6632175Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6634525Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6636991Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6639344Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6641649Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6643931Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6646263Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6648452Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6650676Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6652891Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:10.6654485Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:10.6656171Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:10.6658397Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:10.6660318Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6661886Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6885242Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6887224Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6888842Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6890378Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6892092Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6893846Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6895603Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6897189Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6898836Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6900438Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6902230Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6903925Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6905732Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6907492Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6909220Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6910978Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6912858Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6914619Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6916379Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6918151Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6919799Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6921283Z INFO:root:copying _skbuild/linux-x86_64-3.11/setuptools/lib.linux-x86_64-cpython-311/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:10.6922165Z INFO:skbuild:copied 125 files 2025-05-07T20:03:10.6922453Z INFO:root:running install_egg_info 2025-05-07T20:03:10.6948755Z INFO:root:running egg_info 2025-05-07T20:03:10.6974480Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:03:10.6976820Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:03:10.6978739Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:03:10.6979693Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:03:10.7064364Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:03:10.7095030Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:03:10.7095991Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.11.egg-info 2025-05-07T20:03:10.7102714Z INFO:root:running install_scripts 2025-05-07T20:03:10.7103291Z INFO:skbuild:copied 0 files 2025-05-07T20:03:13.3649139Z INFO:root:creating _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:03:13.3650903Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-7vekhhi3/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:03:13.3652763Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:03:13.3917410Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:03:13.3934231Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:03:13.3935138Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:03:13.5950065Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:03:13.6078195Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:03:13.6187989Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:03:14.5841205Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:03:14.6948211Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:03:15.0580069Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:03:15.1180835Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:03:15.4320933Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:03:23.9254619Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:03:24.5314068Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:03:38.7147043Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:03:40.2385588Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:03:42.1564812Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:03:42.7027460Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:03:42.9211825Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:03:47.3940268Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:03:53.1922815Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:03:53.9520924Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:03:53.9695489Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:03:53.9696594Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:03:53.9698498Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:03:53.9701795Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:03:53.9704471Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:03:53.9707455Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:03:53.9718347Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:03:53.9722067Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:03:53.9724678Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:03:53.9726273Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:03:53.9727524Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:03:53.9729257Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:03:53.9732363Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:03:53.9754589Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:03:53.9796459Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:03:53.9801812Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:03:53.9803252Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:03:53.9805204Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:03:53.9806525Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:03:53.9808187Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:03:53.9809883Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:03:53.9811616Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:03:53.9812785Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:03:53.9814546Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:03:53.9816820Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:03:53.9818590Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:03:53.9820507Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:03:53.9822121Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:03:53.9827660Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:03:53.9829515Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:03:53.9831147Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:03:53.9832915Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:03:53.9834952Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:03:53.9836859Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:03:53.9842884Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:03:53.9845259Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:03:53.9847593Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:03:53.9849944Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:03:53.9851408Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:03:53.9853208Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:03:53.9855341Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:03:53.9858831Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:03:53.9862658Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:03:53.9864620Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:03:53.9866657Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:03:53.9872053Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:03:53.9877386Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:03:53.9879560Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:03:53.9883129Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:03:53.9888686Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:03:53.9891316Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:03:53.9894228Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:03:53.9897600Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:03:53.9899659Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:03:53.9901357Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:03:53.9904332Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:03:53.9907297Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:03:53.9910171Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:03:53.9913207Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:03:53.9916324Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:03:53.9919215Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:03:53.9922318Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:03:53.9925469Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:03:53.9928270Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:03:53.9930068Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:03:53.9932562Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:03:53.9933833Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:03:53.9935655Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:03:53.9937551Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:03:53.9942352Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:03:53.9944652Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:03:53.9946962Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:03:53.9948603Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:03:53.9950101Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:03:53.9953246Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:03:53.9955952Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:03:53.9958221Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:03:53.9959844Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:03:53.9961298Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:03:53.9962881Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:03:53.9964243Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:03:53.9965527Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:03:53.9971204Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:03:53.9996521Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:03:53.9999944Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:03:54.0002741Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:03:54.0004205Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:03:54.0006864Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:03:54.0008514Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:03:54.0009989Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:03:54.0011443Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:03:54.0013883Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:03:54.0019215Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:03:54.0024546Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:03:54.0028213Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:03:54.0030469Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:03:54.0035038Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:03:54.0036706Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:03:54.0044517Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:03:54.0046566Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:03:54.0048809Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:03:54.0050135Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:03:54.0052281Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:03:54.0054684Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:03:54.0055823Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:03:54.0056520Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:03:54.0063110Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:03:54.0066252Z INFO:root:removing _skbuild/linux-x86_64-3.11/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:54.1053427Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:03:54.1053985Z │ │ Version │ 2025-05-07T20:03:54.1054552Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:03:54.1057105Z │ PyTorch │ 2.8.0.dev20250507+cu118 │ 2025-05-07T20:03:54.1057676Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:54.1058209Z │ CUDA (Declared by PyTorch) │ 11.8 │ 2025-05-07T20:03:54.1058866Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:54.1059415Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:03:54.1059958Z │ │ Copyright (c) 2005-2022 NVIDIA Corporation │ 2025-05-07T20:03:54.1060456Z │ │ Built on Wed_Sep_21_10:33:58_PDT_2022 │ 2025-05-07T20:03:54.1060932Z │ │ Cuda compilation tools, release 11.8, V11.8.89 │ 2025-05-07T20:03:54.1061488Z │ │ Build cuda_11.8.r11.8/compiler.31833905_0 │ 2025-05-07T20:03:54.1062043Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:03:54.4172627Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:03:54.4954870Z 2025-05-07T20:03:54.5120778Z ################################################################################ 2025-05-07T20:03:54.5121387Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5121858Z [CHECK] Listing out library size: 2025-05-07T20:03:54.5122282Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5122620Z 2025-05-07T20:03:54.5136574Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5136971Z 2025-05-07T20:03:54.5137475Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5138401Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.5138973Z 2025-05-07T20:03:54.5212691Z GLIBC_2.2.5 2025-05-07T20:03:54.5213705Z GLIBC_2.14 2025-05-07T20:03:54.5215905Z 2025-05-07T20:03:54.5215910Z 2025-05-07T20:03:54.5216304Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5217245Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.5217828Z 2025-05-07T20:03:54.5278758Z GLIBCXX_3.4 2025-05-07T20:03:54.5280009Z 2025-05-07T20:03:54.5280024Z 2025-05-07T20:03:54.5306388Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so > /tmp/tmp.M0Ko6Vls4L.symbols.txt 2025-05-07T20:03:54.5307683Z 2025-05-07T20:03:54.5336602Z 2025-05-07T20:03:54.5365202Z [CHECK] Total Number of symbols: 841 2025-05-07T20:03:54.5382678Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:03:54.5399810Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so > /tmp/tmp.yyOWHsrkvJ.usymbols.txt 2025-05-07T20:03:54.5401174Z 2025-05-07T20:03:54.5419623Z 2025-05-07T20:03:54.5446545Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:03:54.5465155Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.5466219Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.5467215Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.5468186Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.5469117Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:54.5470486Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.5470808Z U abort@GLIBC_2.2.5 2025-05-07T20:03:54.5471109Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:54.5471390Z U close@GLIBC_2.2.5 2025-05-07T20:03:54.5471763Z U fputs@GLIBC_2.2.5 2025-05-07T20:03:54.5472046Z U free@GLIBC_2.2.5 2025-05-07T20:03:54.5472525Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:03:54.5472852Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:54.5473138Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:54.5473451Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:03:54.5473786Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:54.5474088Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:54.5474436Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:54.5474739Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.5475023Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:54.5475328Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.5475632Z U mmap@GLIBC_2.2.5 2025-05-07T20:03:54.5475917Z U mprotect@GLIBC_2.2.5 2025-05-07T20:03:54.5476227Z U munmap@GLIBC_2.2.5 2025-05-07T20:03:54.5476518Z U open64@GLIBC_2.2.5 2025-05-07T20:03:54.5476840Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.5477188Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:03:54.5477539Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.5477874Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.5478206Z U read@GLIBC_2.2.5 2025-05-07T20:03:54.5478506Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:54.5478910Z U shm_open@GLIBC_2.2.5 2025-05-07T20:03:54.5479205Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:03:54.5479489Z U snprintf@GLIBC_2.2.5 2025-05-07T20:03:54.5479815Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.5480108Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:54.5480394Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:54.5480661Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.5480946Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:54.5481218Z U syscall@GLIBC_2.2.5 2025-05-07T20:03:54.5481507Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:54.5481792Z U uname@GLIBC_2.2.5 2025-05-07T20:03:54.5482055Z U unlink@GLIBC_2.2.5 2025-05-07T20:03:54.5482345Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:03:54.5482679Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.5483094Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.5483501Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.5483883Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.5484200Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.5484497Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.5484796Z w __gmon_start__ 2025-05-07T20:03:54.5485104Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.5485500Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5485902Z 2025-05-07T20:03:54.5512218Z linux-vdso.so.1 (0x00007ffd5198c000) 2025-05-07T20:03:54.5513058Z libtorch_cpu.so => not found 2025-05-07T20:03:54.5513417Z libtorch_cuda.so => not found 2025-05-07T20:03:54.5513733Z libtorch.so => not found 2025-05-07T20:03:54.5514065Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff2a33e5000) 2025-05-07T20:03:54.5514517Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007ff2a338f000) 2025-05-07T20:03:54.5514928Z librt.so.1 => /lib64/librt.so.1 (0x00007ff2a3388000) 2025-05-07T20:03:54.5515338Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff2a335a000) 2025-05-07T20:03:54.5520543Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007ff2a3355000) 2025-05-07T20:03:54.5520964Z libc.so.6 => /lib64/libc.so.6 (0x00007ff2a314d000) 2025-05-07T20:03:54.5521342Z libm.so.6 => /lib64/libm.so.6 (0x00007ff2a3072000) 2025-05-07T20:03:54.5521708Z /lib64/ld-linux-x86-64.so.2 (0x00007ff2a36c5000) 2025-05-07T20:03:54.5522050Z 2025-05-07T20:03:54.5522163Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.5522587Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so 2025-05-07T20:03:54.5522890Z 2025-05-07T20:03:54.5548758Z 2025-05-07T20:03:54.5549483Z Dynamic section at offset 0x74dd0 contains 35 entries: 2025-05-07T20:03:54.5550836Z Tag Type Name/Value 2025-05-07T20:03:54.5551276Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.5551977Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.5552630Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.5553156Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.5553699Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.5554209Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:03:54.5554732Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.5555255Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:03:54.5555783Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.5556293Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:03:54.5556716Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:03:54.5557068Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:03:54.5557403Z 0x0000000000000019 (INIT_ARRAY) 0x74ff8 2025-05-07T20:03:54.5557762Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.5558110Z 0x000000000000001a (FINI_ARRAY) 0x75000 2025-05-07T20:03:54.5558469Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.5558816Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.5559165Z 0x0000000000000005 (STRTAB) 0x7120 2025-05-07T20:03:54.5559509Z 0x0000000000000006 (SYMTAB) 0x2230 2025-05-07T20:03:54.5559861Z 0x000000000000000a (STRSZ) 48790 (bytes) 2025-05-07T20:03:54.5560234Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.5560579Z 0x0000000000000003 (PLTGOT) 0x76050 2025-05-07T20:03:54.5560949Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:03:54.5561301Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.5561641Z 0x0000000000000017 (JMPREL) 0x16a58 2025-05-07T20:03:54.5562025Z 0x0000000000000007 (RELA) 0x13710 2025-05-07T20:03:54.5562374Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:03:54.5562820Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.5563148Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.5563489Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.5563844Z 0x000000006ffffffe (VERNEED) 0x13650 2025-05-07T20:03:54.5564190Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.5564520Z 0x000000006ffffff0 (VERSYM) 0x12fb6 2025-05-07T20:03:54.5564862Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:03:54.5565189Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.5565395Z 2025-05-07T20:03:54.5565510Z ################################################################################ 2025-05-07T20:03:54.5565741Z 2025-05-07T20:03:54.5565759Z 2025-05-07T20:03:54.5565875Z ################################################################################ 2025-05-07T20:03:54.5566358Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5566906Z [CHECK] Listing out library size: 2025-05-07T20:03:54.5567369Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5567728Z 2025-05-07T20:03:54.5567922Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5568265Z 2025-05-07T20:03:54.5568672Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5569638Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.5570237Z 2025-05-07T20:03:54.5617283Z GLIBC_2.2.5 2025-05-07T20:03:54.5617955Z GLIBC_2.14 2025-05-07T20:03:54.5618318Z 2025-05-07T20:03:54.5618618Z 2025-05-07T20:03:54.5619803Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5622934Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.5623502Z 2025-05-07T20:03:54.5670729Z GLIBCXX_3.4 2025-05-07T20:03:54.5671379Z GLIBCXX_3.4.9 2025-05-07T20:03:54.5672029Z GLIBCXX_3.4.21 2025-05-07T20:03:54.5672625Z 2025-05-07T20:03:54.5672646Z 2025-05-07T20:03:54.5692061Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.6lerIIPXLx.symbols.txt 2025-05-07T20:03:54.5692554Z 2025-05-07T20:03:54.5711986Z 2025-05-07T20:03:54.5741298Z [CHECK] Total Number of symbols: 116 2025-05-07T20:03:54.5754614Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:54.5776177Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.DiqT937eIv.usymbols.txt 2025-05-07T20:03:54.5776723Z 2025-05-07T20:03:54.5799775Z 2025-05-07T20:03:54.5831776Z [CHECK] Listing out undefined symbols (59 total): 2025-05-07T20:03:54.5852651Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.5853258Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.5853643Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.5854052Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.5854377Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.5854722Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.5855052Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.5855392Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.5855706Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:54.5856053Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.5856397Z U c10::BoolType::get() 2025-05-07T20:03:54.5856703Z U c10::StringType::get() 2025-05-07T20:03:54.5857051Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.5857844Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.5859115Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.5859954Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:54.5860252Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:54.5860556Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.5860945Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.5861263Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.5861731Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.5862121Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:54.5862914Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:54.5863707Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.5864609Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.5865483Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.5866519Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.5867518Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.5868394Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.5869394Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.5870329Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.5870933Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.5871320Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.5871704Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.5872085Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.5872653Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.5873779Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.5874628Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.5874993Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.5875363Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.5875704Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.5876049Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.5876363Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.5876667Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:54.5876995Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.5877841Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.5879100Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:54.5880158Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.5880822Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:03:54.5881246Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.5881680Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.5882128Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.5882793Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.5883478Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.5883949Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.5884315Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.5884647Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.5884973Z w __gmon_start__ 2025-05-07T20:03:54.5885277Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:03:54.5885863Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.5886313Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5886621Z 2025-05-07T20:03:54.5895652Z linux-vdso.so.1 (0x00007ffd1d7ea000) 2025-05-07T20:03:54.5896547Z libtorch.so => not found 2025-05-07T20:03:54.5897284Z libc10.so => not found 2025-05-07T20:03:54.5897993Z libtorch_cpu.so => not found 2025-05-07T20:03:54.5898787Z libtorch_cuda.so => not found 2025-05-07T20:03:54.5899772Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6cb8660000) 2025-05-07T20:03:54.5901007Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f6cb8608000) 2025-05-07T20:03:54.5901872Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6cb85da000) 2025-05-07T20:03:54.5902309Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f6cb85d5000) 2025-05-07T20:03:54.5902736Z libc.so.6 => /lib64/libc.so.6 (0x00007f6cb83cd000) 2025-05-07T20:03:54.5903095Z libm.so.6 => /lib64/libm.so.6 (0x00007f6cb82f2000) 2025-05-07T20:03:54.5903469Z /lib64/ld-linux-x86-64.so.2 (0x00007f6cb88d3000) 2025-05-07T20:03:54.5903709Z 2025-05-07T20:03:54.5903834Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.5904253Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:54.5904585Z 2025-05-07T20:03:54.5941910Z 2025-05-07T20:03:54.5942624Z Dynamic section at offset 0x8aa8 contains 35 entries: 2025-05-07T20:03:54.5943887Z Tag Type Name/Value 2025-05-07T20:03:54.5945133Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.5946638Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.5948141Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.5949719Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.5951251Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.5953020Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.5954090Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.5954618Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:03:54.5955153Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.5955676Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:03:54.5956134Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:03:54.5956461Z 0x000000000000000d (FINI) 0x6890 2025-05-07T20:03:54.5956801Z 0x0000000000000019 (INIT_ARRAY) 0x99c0 2025-05-07T20:03:54.5957159Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:03:54.5957507Z 0x000000000000001a (FINI_ARRAY) 0x99d0 2025-05-07T20:03:54.5957861Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.5958207Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.5958547Z 0x0000000000000005 (STRTAB) 0xff0 2025-05-07T20:03:54.5958867Z 0x0000000000000006 (SYMTAB) 0x4f8 2025-05-07T20:03:54.5959227Z 0x000000000000000a (STRSZ) 7890 (bytes) 2025-05-07T20:03:54.5959586Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.5959943Z 0x0000000000000003 (PLTGOT) 0x9d28 2025-05-07T20:03:54.5960446Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:03:54.5960799Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.5961143Z 0x0000000000000017 (JMPREL) 0x3520 2025-05-07T20:03:54.5961473Z 0x0000000000000007 (RELA) 0x3070 2025-05-07T20:03:54.5961885Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:03:54.5962282Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.5962633Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.5962964Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.5963334Z 0x000000006ffffffe (VERNEED) 0x2fb0 2025-05-07T20:03:54.5963688Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:54.5964018Z 0x000000006ffffff0 (VERSYM) 0x2ec2 2025-05-07T20:03:54.5964439Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:03:54.5964753Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.5964994Z 2025-05-07T20:03:54.5965109Z ################################################################################ 2025-05-07T20:03:54.5965343Z 2025-05-07T20:03:54.5965347Z 2025-05-07T20:03:54.5965476Z ################################################################################ 2025-05-07T20:03:54.5965974Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.5966476Z [CHECK] Listing out library size: 2025-05-07T20:03:54.5966935Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.5967326Z 2025-05-07T20:03:54.5967524Z 11 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.5967834Z 2025-05-07T20:03:54.5968239Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.5969223Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.5969833Z 2025-05-07T20:03:54.6045536Z GLIBC_2.2.5 2025-05-07T20:03:54.6046179Z GLIBC_2.14 2025-05-07T20:03:54.6050060Z 2025-05-07T20:03:54.6050075Z 2025-05-07T20:03:54.6051333Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.6054173Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.6065451Z 2025-05-07T20:03:54.6135677Z GLIBCXX_3.4 2025-05-07T20:03:54.6136861Z GLIBCXX_3.4.9 2025-05-07T20:03:54.6137545Z GLIBCXX_3.4.11 2025-05-07T20:03:54.6138153Z GLIBCXX_3.4.20 2025-05-07T20:03:54.6138753Z GLIBCXX_3.4.21 2025-05-07T20:03:54.6139119Z 2025-05-07T20:03:54.6139156Z 2025-05-07T20:03:54.6160826Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.zo5RAEK02K.symbols.txt 2025-05-07T20:03:54.6161335Z 2025-05-07T20:03:54.6212653Z 2025-05-07T20:03:54.6241345Z [CHECK] Total Number of symbols: 819 2025-05-07T20:03:54.6258157Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:03:54.6274616Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.VURVMAc2Bj.usymbols.txt 2025-05-07T20:03:54.6275137Z 2025-05-07T20:03:54.6295782Z 2025-05-07T20:03:54.6320993Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:03:54.6338838Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.6340649Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.6341699Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.6342733Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.6343290Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.6343693Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.6345380Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.6345750Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.6346137Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.6346559Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.6346891Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.6347256Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.6347589Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.6347917Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.6348259Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.6348591Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.6348951Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.6349318Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:54.6349729Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:54.6350512Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.6351706Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.6353176Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.6354261Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:54.6355219Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.6356214Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:54.6356912Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:54.6357898Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.6359105Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.6359983Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:54.6360417Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:54.6360772Z U c10::BoolType::get() 2025-05-07T20:03:54.6361151Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.6361562Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:54.6361969Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6362423Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.6362780Z U c10::IntType::get() 2025-05-07T20:03:54.6363217Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.6363735Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.6364155Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.6364847Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.6365614Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.6365969Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.6366339Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.6366716Z U c10::TensorType::get() 2025-05-07T20:03:54.6367039Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.6367967Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.6368895Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.6369719Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.6370051Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.6370392Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.6370718Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.6371057Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.6371500Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.6371965Z U c10::cuda::current_device() 2025-05-07T20:03:54.6372275Z U c10::cuda::device_count() 2025-05-07T20:03:54.6372597Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.6372970Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.6373335Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.6373716Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.6374114Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.6374477Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.6375193Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.6376030Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.6376871Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.6377959Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.6379000Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.6379827Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.6380338Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.6380709Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:54.6381159Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:54.6381567Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.6381953Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.6382366Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.6382728Z U c10::throwNullDataPtrError() 2025-05-07T20:03:54.6383071Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.6383396Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:54.6383827Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.6384261Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:54.6384719Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.6385112Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.6385495Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.6386038Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.6386430Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.6386851Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.6387282Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.6387640Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.6388028Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:54.6388384Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:54.6388756Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:54.6389107Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.6389450Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.6389796Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.6390136Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:54.6390670Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.6391228Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:54.6391582Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:54.6391941Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.6392299Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.6392765Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.6393167Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6393591Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.6393994Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6394351Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:54.6394737Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.6395166Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.6395577Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6395938Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.6396238Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.6396559Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.6396906Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.6397250Z U printf@GLIBC_2.2.5 2025-05-07T20:03:54.6397541Z U puts@GLIBC_2.2.5 2025-05-07T20:03:54.6398096Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.6398965Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.6399901Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6400998Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.6402072Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6403006Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6404017Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.6405250Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6406013Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.6406422Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.6406821Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.6407226Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:54.6407690Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.6408598Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.6409379Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.6409712Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.6410054Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.6410388Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.6410770Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.6411286Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.6411722Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.6412027Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.6412334Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.6413113Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.6414421Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.6415259Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.6416006Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.6416891Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6417378Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.6417822Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.6418270Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.6418889Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.6419585Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.6420043Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.6420386Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.6420712Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.6421013Z w __gmon_start__ 2025-05-07T20:03:54.6421301Z w __pthread_key_create 2025-05-07T20:03:54.6421609Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.6421952Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.6422320Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.6422791Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.6423105Z 2025-05-07T20:03:54.6423255Z linux-vdso.so.1 (0x00007fff8b1fd000) 2025-05-07T20:03:54.6423590Z libtorch.so => not found 2025-05-07T20:03:54.6423851Z libc10.so => not found 2025-05-07T20:03:54.6424094Z libc10_cuda.so => not found 2025-05-07T20:03:54.6424372Z libtorch_cpu.so => not found 2025-05-07T20:03:54.6424659Z libtorch_cuda.so => not found 2025-05-07T20:03:54.6424957Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.6425312Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fabfad9c000) 2025-05-07T20:03:54.6425741Z libm.so.6 => /lib64/libm.so.6 (0x00007fabfacc1000) 2025-05-07T20:03:54.6426156Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fabfac6b000) 2025-05-07T20:03:54.6426575Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fabfac3d000) 2025-05-07T20:03:54.6426978Z libc.so.6 => /lib64/libc.so.6 (0x00007fabfaa35000) 2025-05-07T20:03:54.6427368Z /lib64/ld-linux-x86-64.so.2 (0x00007fabfbc99000) 2025-05-07T20:03:54.6427622Z 2025-05-07T20:03:54.6427732Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.6428167Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:54.6428514Z 2025-05-07T20:03:54.6428518Z 2025-05-07T20:03:54.6428674Z Dynamic section at offset 0xa76868 contains 37 entries: 2025-05-07T20:03:54.6429260Z Tag Type Name/Value 2025-05-07T20:03:54.6429822Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.6430344Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.6430859Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.6431408Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.6432096Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.6432726Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.6433272Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.6433821Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:54.6434339Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.6434860Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.6435369Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.6435913Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:54.6436368Z 0x000000000000000c (INIT) 0x2e000 2025-05-07T20:03:54.6436717Z 0x000000000000000d (FINI) 0xc47fc 2025-05-07T20:03:54.6437053Z 0x0000000000000019 (INIT_ARRAY) 0xa75ea0 2025-05-07T20:03:54.6437421Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:03:54.6437777Z 0x000000000000001a (FINI_ARRAY) 0xa75f70 2025-05-07T20:03:54.6438136Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.6438494Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.6438828Z 0x0000000000000005 (STRTAB) 0x6b50 2025-05-07T20:03:54.6439169Z 0x0000000000000006 (SYMTAB) 0x1e70 2025-05-07T20:03:54.6439522Z 0x000000000000000a (STRSZ) 120164 (bytes) 2025-05-07T20:03:54.6439898Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.6440244Z 0x0000000000000003 (PLTGOT) 0xa77b08 2025-05-07T20:03:54.6440618Z 0x0000000000000002 (PLTRELSZ) 10416 (bytes) 2025-05-07T20:03:54.6440981Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.6441307Z 0x0000000000000017 (JMPREL) 0x2aa30 2025-05-07T20:03:54.6441650Z 0x0000000000000007 (RELA) 0x24820 2025-05-07T20:03:54.6442002Z 0x0000000000000008 (RELASZ) 25104 (bytes) 2025-05-07T20:03:54.6442373Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.6442700Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.6443041Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.6443450Z 0x000000006ffffffe (VERNEED) 0x24720 2025-05-07T20:03:54.6443798Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.6444139Z 0x000000006ffffff0 (VERSYM) 0x240b4 2025-05-07T20:03:54.6444474Z 0x000000006ffffff9 (RELACOUNT) 176 2025-05-07T20:03:54.6444930Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.6445131Z 2025-05-07T20:03:54.6445269Z ################################################################################ 2025-05-07T20:03:54.6445512Z 2025-05-07T20:03:54.6445516Z 2025-05-07T20:03:54.6445627Z ################################################################################ 2025-05-07T20:03:54.6446138Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6446666Z [CHECK] Listing out library size: 2025-05-07T20:03:54.6447132Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6447510Z 2025-05-07T20:03:54.6447726Z 5 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6448049Z 2025-05-07T20:03:54.6448440Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6449449Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.6450050Z 2025-05-07T20:03:54.6489381Z GLIBC_2.2.5 2025-05-07T20:03:54.6490019Z GLIBC_2.3 2025-05-07T20:03:54.6490556Z GLIBC_2.14 2025-05-07T20:03:54.6490924Z 2025-05-07T20:03:54.6490939Z 2025-05-07T20:03:54.6491737Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6492837Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.6493483Z 2025-05-07T20:03:54.6553315Z GLIBCXX_3.4 2025-05-07T20:03:54.6553988Z GLIBCXX_3.4.9 2025-05-07T20:03:54.6554631Z GLIBCXX_3.4.11 2025-05-07T20:03:54.6555215Z GLIBCXX_3.4.18 2025-05-07T20:03:54.6555799Z GLIBCXX_3.4.21 2025-05-07T20:03:54.6556183Z 2025-05-07T20:03:54.6556196Z 2025-05-07T20:03:54.6574754Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.PiTGDwE5C6.symbols.txt 2025-05-07T20:03:54.6576304Z 2025-05-07T20:03:54.6600814Z 2025-05-07T20:03:54.6628162Z [CHECK] Total Number of symbols: 338 2025-05-07T20:03:54.6648266Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:03:54.6666292Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.zqEKt3OcJc.usymbols.txt 2025-05-07T20:03:54.6666831Z 2025-05-07T20:03:54.6688208Z 2025-05-07T20:03:54.6716782Z [CHECK] Listing out undefined symbols (128 total): 2025-05-07T20:03:54.6738619Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.6741554Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.6743182Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.6744247Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.6745428Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.6746588Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.6747712Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.6748392Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.6748766Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.6749136Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.6749502Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.6749957Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.6750293Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.6750603Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.6750932Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:54.6751310Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.6751644Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.6752027Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:54.6752530Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:54.6753026Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:54.6753499Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:54.6753940Z U c10::BoolType::get() 2025-05-07T20:03:54.6754315Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.6754693Z U c10::FloatType::get() 2025-05-07T20:03:54.6755029Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:54.6755472Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6755960Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.6756331Z U c10::IntType::get() 2025-05-07T20:03:54.6756749Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.6757180Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.6757596Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.6758016Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:54.6758744Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.6759457Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.6759841Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.6760205Z U c10::TensorType::get() 2025-05-07T20:03:54.6760544Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.6761546Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.6762583Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.6762965Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.6763352Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.6763718Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.6764100Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.6764484Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.6765084Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.6765573Z U c10::cuda::device_count() 2025-05-07T20:03:54.6765911Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.6766311Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.6766716Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.6767101Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.6767516Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.6767891Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.6768627Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.6769530Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.6770365Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.6771355Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.6772374Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.6773161Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.6773537Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.6773871Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.6774259Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.6774668Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.6775020Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.6775438Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.6775874Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.6776282Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.6776637Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.6776982Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.6777320Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.6777638Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.6777981Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.6778329Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:54.6778687Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.6779012Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.6779344Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.6779662Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.6780010Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.6780365Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.6780694Z U float at::Tensor::item() const 2025-05-07T20:03:54.6781066Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6781454Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6781849Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.6782196Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.6782461Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.6782757Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.6783077Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.6783634Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.6784441Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.6785306Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6786707Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.6787789Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6788795Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6789875Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.6790948Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:54.6791781Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.6792476Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.6792967Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:54.6793369Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.6793784Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.6794172Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:54.6794673Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.6795629Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.6796470Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.6796837Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.6797189Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.6797543Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.6797951Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.6798507Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.6799005Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.6799351Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.6799679Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.6800005Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.6800849Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.6802043Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.6802883Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.6803635Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.6804268Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.6804702Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.6805135Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.6805834Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.6806496Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.6806938Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.6807263Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.6807576Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.6807885Z w __gmon_start__ 2025-05-07T20:03:54.6808159Z w __pthread_key_create 2025-05-07T20:03:54.6808453Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.6808779Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.6809153Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.6809639Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6809965Z 2025-05-07T20:03:54.6810101Z linux-vdso.so.1 (0x00007ffde41fa000) 2025-05-07T20:03:54.6810384Z libtorch.so => not found 2025-05-07T20:03:54.6810629Z libc10.so => not found 2025-05-07T20:03:54.6810859Z libc10_cuda.so => not found 2025-05-07T20:03:54.6811119Z libtorch_cpu.so => not found 2025-05-07T20:03:54.6811374Z libtorch_cuda.so => not found 2025-05-07T20:03:54.6811665Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.6811986Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8c7e39c000) 2025-05-07T20:03:54.6812399Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f8c7ebc4000) 2025-05-07T20:03:54.6812801Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8c7eb96000) 2025-05-07T20:03:54.6813258Z libc.so.6 => /lib64/libc.so.6 (0x00007f8c7e194000) 2025-05-07T20:03:54.6813595Z /lib64/ld-linux-x86-64.so.2 (0x00007f8c7ec20000) 2025-05-07T20:03:54.6813921Z libm.so.6 => /lib64/libm.so.6 (0x00007f8c7e0b9000) 2025-05-07T20:03:54.6814146Z 2025-05-07T20:03:54.6814241Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.6814648Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:54.6814986Z 2025-05-07T20:03:54.6828119Z 2025-05-07T20:03:54.6828286Z Dynamic section at offset 0x467450 contains 37 entries: 2025-05-07T20:03:54.6828663Z Tag Type Name/Value 2025-05-07T20:03:54.6829306Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.6830126Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.6830665Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.6831179Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.6831701Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.6832225Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.6832824Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.6833330Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.6833841Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.6834342Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.6834855Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.6835419Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:03:54.6835882Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:03:54.6836214Z 0x000000000000000d (FINI) 0x31c4c 2025-05-07T20:03:54.6836542Z 0x0000000000000019 (INIT_ARRAY) 0x467fe0 2025-05-07T20:03:54.6836893Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:03:54.6837248Z 0x000000000000001a (FINI_ARRAY) 0x468010 2025-05-07T20:03:54.6837581Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.6837929Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:54.6838251Z 0x0000000000000005 (STRTAB) 0x2cc8 2025-05-07T20:03:54.6838577Z 0x0000000000000006 (SYMTAB) 0xd00 2025-05-07T20:03:54.6838910Z 0x000000000000000a (STRSZ) 38026 (bytes) 2025-05-07T20:03:54.6839270Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.6839607Z 0x0000000000000003 (PLTGOT) 0x4686f0 2025-05-07T20:03:54.6839963Z 0x0000000000000002 (PLTRELSZ) 4752 (bytes) 2025-05-07T20:03:54.6840388Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.6840703Z 0x0000000000000017 (JMPREL) 0xdab0 2025-05-07T20:03:54.6841038Z 0x0000000000000007 (RELA) 0xc508 2025-05-07T20:03:54.6841369Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:03:54.6841758Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.6842233Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.6842601Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.6842947Z 0x000000006ffffffe (VERNEED) 0xc3f8 2025-05-07T20:03:54.6843277Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.6843754Z 0x000000006ffffff0 (VERSYM) 0xc152 2025-05-07T20:03:54.6844138Z 0x000000006ffffff9 (RELACOUNT) 58 2025-05-07T20:03:54.6844640Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.6844846Z 2025-05-07T20:03:54.6844954Z ################################################################################ 2025-05-07T20:03:54.6845196Z 2025-05-07T20:03:54.6845200Z 2025-05-07T20:03:54.6845310Z ################################################################################ 2025-05-07T20:03:54.6845746Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.6846168Z [CHECK] Listing out library size: 2025-05-07T20:03:54.6846566Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.6846873Z 2025-05-07T20:03:54.6847016Z 6 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.6847266Z 2025-05-07T20:03:54.6847592Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.6848465Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.6849003Z 2025-05-07T20:03:54.7121105Z GLIBC_2.2.5 2025-05-07T20:03:54.7121364Z GLIBC_2.3 2025-05-07T20:03:54.7121556Z GLIBC_2.14 2025-05-07T20:03:54.7125092Z 2025-05-07T20:03:54.7125097Z 2025-05-07T20:03:54.7125470Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.7126387Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.7126960Z 2025-05-07T20:03:54.7392609Z GLIBCXX_3.4 2025-05-07T20:03:54.7392869Z GLIBCXX_3.4.9 2025-05-07T20:03:54.7393089Z GLIBCXX_3.4.11 2025-05-07T20:03:54.7393303Z GLIBCXX_3.4.14 2025-05-07T20:03:54.7393558Z GLIBCXX_3.4.15 2025-05-07T20:03:54.7393848Z GLIBCXX_3.4.18 2025-05-07T20:03:54.7394178Z GLIBCXX_3.4.21 2025-05-07T20:03:54.7394619Z 2025-05-07T20:03:54.7394637Z 2025-05-07T20:03:54.7416555Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so > /tmp/tmp.4satB74kG2.symbols.txt 2025-05-07T20:03:54.7416986Z 2025-05-07T20:03:54.7645600Z 2025-05-07T20:03:54.7673075Z [CHECK] Total Number of symbols: 4957 2025-05-07T20:03:54.7690745Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:03:54.7708843Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so > /tmp/tmp.SBe07NBeyE.usymbols.txt 2025-05-07T20:03:54.7709313Z 2025-05-07T20:03:54.7737644Z 2025-05-07T20:03:54.7765386Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:03:54.7780774Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.7781932Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.7782912Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.7783569Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.7783900Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.7784230Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.7784560Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.7784892Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.7785378Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.7785943Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:03:54.7786290Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.7786613Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:54.7787000Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.7787318Z U __extendhfsf2@GCC_12.0.0 2025-05-07T20:03:54.7787690Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.7788021Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:54.7788352Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.7788654Z U __truncsfhf2@GCC_12.0.0 2025-05-07T20:03:54.7788961Z U abort@GLIBC_2.2.5 2025-05-07T20:03:54.7791060Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:54.7791805Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:54.7792993Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:54.7794221Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:54.7795407Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:03:54.7796214Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:54.7796813Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:54.7797450Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:03:54.7798114Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:03:54.7798637Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:03:54.7799297Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:03:54.7800020Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:03:54.7800597Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:03:54.7801025Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:03:54.7801616Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:03:54.7802226Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:03:54.7802662Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:03:54.7803119Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:03:54.7803462Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:54.7803733Z U ceilf@GLIBC_2.2.5 2025-05-07T20:03:54.7804024Z U cpuinfo_get_packages 2025-05-07T20:03:54.7804321Z U cpuinfo_get_packages_count 2025-05-07T20:03:54.7804629Z U cpuinfo_initialize 2025-05-07T20:03:54.7804896Z U cpuinfo_isa 2025-05-07T20:03:54.7805155Z U floor@GLIBC_2.2.5 2025-05-07T20:03:54.7805417Z U fma@GLIBC_2.2.5 2025-05-07T20:03:54.7805686Z U fmaf@GLIBC_2.2.5 2025-05-07T20:03:54.7805958Z U free@GLIBC_2.2.5 2025-05-07T20:03:54.7806220Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:54.7806587Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:54.7806857Z U ldexp@GLIBC_2.2.5 2025-05-07T20:03:54.7807139Z U log2@GLIBC_2.2.5 2025-05-07T20:03:54.7807401Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:54.7807684Z U lrintf@GLIBC_2.2.5 2025-05-07T20:03:54.7807981Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.7808261Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.7808575Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:03:54.7808864Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:03:54.7809186Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.7809507Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:03:54.7809849Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.7810212Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.7810555Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:03:54.7810844Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:54.7811237Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:54.7811722Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:54.7812152Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:54.7812804Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:54.7813533Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7814517Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7815722Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7816729Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7817684Z U std::__cxx11::basic_string, std::allocator >::compare(char const*) const@GLIBCXX_3.4.21 2025-05-07T20:03:54.7818514Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.7819229Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.7819755Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:54.7820134Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:54.7820598Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:03:54.7821082Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:54.7821541Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:54.7821938Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:54.7822255Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:54.7822587Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.7822903Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:54.7823235Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:54.7823572Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:54.7823948Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:03:54.7824333Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.7824739Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:54.7825102Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.7825878Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.7826879Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:03:54.7827195Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:54.7827548Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:54.7828130Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:03:54.7828504Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:03:54.7828926Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.7829290Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.7829975Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7830742Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7831274Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:54.7831797Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.7832361Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.7832925Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:54.7833300Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.7833686Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:54.7834160Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:54.7834715Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:54.7835163Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:54.7835551Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.7835875Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:54.7836190Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:54.7836487Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.7836796Z U strstr@GLIBC_2.2.5 2025-05-07T20:03:54.7837088Z U tolower@GLIBC_2.2.5 2025-05-07T20:03:54.7837406Z U toupper@GLIBC_2.2.5 2025-05-07T20:03:54.7837802Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:03:54.7838238Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:54.7838644Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:54.7839039Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:54.7839549Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.7839963Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.7840335Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:54.7840691Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:54.7841023Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.7841337Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.7841629Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.7841923Z w __gmon_start__ 2025-05-07T20:03:54.7842180Z w __pthread_key_create 2025-05-07T20:03:54.7842484Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.7842805Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.7843096Z w pthread_once 2025-05-07T20:03:54.7843427Z w pthread_rwlock_rdlock 2025-05-07T20:03:54.7843709Z w pthread_rwlock_unlock 2025-05-07T20:03:54.7844002Z w pthread_rwlock_wrlock 2025-05-07T20:03:54.7844285Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:03:54.7844650Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.7845029Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.7845311Z 2025-05-07T20:03:54.7845434Z linux-vdso.so.1 (0x00007ffd69dfe000) 2025-05-07T20:03:54.7845726Z libc10.so => not found 2025-05-07T20:03:54.7846214Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f2cf178b000) 2025-05-07T20:03:54.7846767Z libtorch.so => not found 2025-05-07T20:03:54.7847010Z libtorch_cpu.so => not found 2025-05-07T20:03:54.7847310Z libtorch_cuda.so => not found 2025-05-07T20:03:54.7847628Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2cf0f9c000) 2025-05-07T20:03:54.7848015Z libm.so.6 => /lib64/libm.so.6 (0x00007f2cf0ec1000) 2025-05-07T20:03:54.7848389Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2cf0e93000) 2025-05-07T20:03:54.7848749Z libc.so.6 => /lib64/libc.so.6 (0x00007f2cf0c8b000) 2025-05-07T20:03:54.7849102Z /lib64/ld-linux-x86-64.so.2 (0x00007f2cf1807000) 2025-05-07T20:03:54.7849417Z libtorch_cpu.so => not found 2025-05-07T20:03:54.7849685Z libtorch_cuda.so => not found 2025-05-07T20:03:54.7849939Z libtorch.so => not found 2025-05-07T20:03:54.7850243Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2cf0c35000) 2025-05-07T20:03:54.7850613Z librt.so.1 => /lib64/librt.so.1 (0x00007f2cf1782000) 2025-05-07T20:03:54.7851013Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2cf177d000) 2025-05-07T20:03:54.7851283Z 2025-05-07T20:03:54.7851396Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.7851744Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so 2025-05-07T20:03:54.7852021Z 2025-05-07T20:03:54.7869146Z 2025-05-07T20:03:54.7870372Z Dynamic section at offset 0x54e508 contains 37 entries: 2025-05-07T20:03:54.7871573Z Tag Type Name/Value 2025-05-07T20:03:54.7873063Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.7873884Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:03:54.7874407Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.7874927Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.7875464Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.7875996Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.7876522Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:54.7877041Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.7877539Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.7878076Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.7878601Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:03:54.7879211Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.7879611Z 0x000000000000000c (INIT) 0xfd000 2025-05-07T20:03:54.7879943Z 0x000000000000000d (FINI) 0x4c1d18 2025-05-07T20:03:54.7880281Z 0x0000000000000019 (INIT_ARRAY) 0x54b000 2025-05-07T20:03:54.7880628Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:03:54.7880985Z 0x000000000000001a (FINI_ARRAY) 0x54b4c8 2025-05-07T20:03:54.7881317Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.7881727Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:54.7882235Z 0x0000000000000005 (STRTAB) 0x24e38 2025-05-07T20:03:54.7882575Z 0x0000000000000006 (SYMTAB) 0x7d68 2025-05-07T20:03:54.7883097Z 0x000000000000000a (STRSZ) 754916 (bytes) 2025-05-07T20:03:54.7883481Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.7883974Z 0x0000000000000003 (PLTGOT) 0x54e798 2025-05-07T20:03:54.7884331Z 0x0000000000000002 (PLTRELSZ) 26136 (bytes) 2025-05-07T20:03:54.7884774Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.7885135Z 0x0000000000000017 (JMPREL) 0xf6768 2025-05-07T20:03:54.7885478Z 0x0000000000000007 (RELA) 0xdfb48 2025-05-07T20:03:54.7886054Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:03:54.7886431Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.7886783Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.7887186Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.7887554Z 0x000000006ffffffe (VERNEED) 0xdf9d8 2025-05-07T20:03:54.7887887Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.7888229Z 0x000000006ffffff0 (VERSYM) 0xdd31c 2025-05-07T20:03:54.7888558Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:03:54.7888882Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.7889094Z 2025-05-07T20:03:54.7889228Z ################################################################################ 2025-05-07T20:03:54.7889458Z 2025-05-07T20:03:54.7889462Z 2025-05-07T20:03:54.7889577Z ################################################################################ 2025-05-07T20:03:54.7890087Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.7890574Z [CHECK] Listing out library size: 2025-05-07T20:03:54.7891047Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.7891424Z 2025-05-07T20:03:54.7891636Z 2 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.7891944Z 2025-05-07T20:03:54.7892335Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.7893340Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.7893945Z 2025-05-07T20:03:54.7945807Z GLIBC_2.2.5 2025-05-07T20:03:54.7946454Z GLIBC_2.14 2025-05-07T20:03:54.7946822Z 2025-05-07T20:03:54.7946858Z 2025-05-07T20:03:54.7948095Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.7949373Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.7950007Z 2025-05-07T20:03:54.8011752Z GLIBCXX_3.4 2025-05-07T20:03:54.8012427Z GLIBCXX_3.4.9 2025-05-07T20:03:54.8013029Z GLIBCXX_3.4.14 2025-05-07T20:03:54.8013638Z GLIBCXX_3.4.20 2025-05-07T20:03:54.8014230Z GLIBCXX_3.4.21 2025-05-07T20:03:54.8014590Z 2025-05-07T20:03:54.8014604Z 2025-05-07T20:03:54.8033279Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.6Ty2edbC9W.symbols.txt 2025-05-07T20:03:54.8033820Z 2025-05-07T20:03:54.8063813Z 2025-05-07T20:03:54.8088462Z [CHECK] Total Number of symbols: 540 2025-05-07T20:03:54.8104639Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:03:54.8119481Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.M9f6vbKG48.usymbols.txt 2025-05-07T20:03:54.8120919Z 2025-05-07T20:03:54.8139276Z 2025-05-07T20:03:54.8163279Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:54.8179443Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8180123Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.8180521Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8181098Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8181504Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8181887Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.8182341Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.8182712Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.8183120Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8183495Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.8183824Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.8184148Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.8184466Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.8184841Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.8185172Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.8185510Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.8186076Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.8186396Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.8186698Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.8187008Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.8187534Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:54.8188120Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:54.8188599Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:54.8189559Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8190479Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:03:54.8190933Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.8191440Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:54.8192108Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:54.8193342Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:54.8194243Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:54.8195070Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8195931Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:54.8196278Z U at::get_num_threads() 2025-05-07T20:03:54.8196594Z U at::get_thread_num() 2025-05-07T20:03:54.8196917Z U at::internal::set_thread_num(int) 2025-05-07T20:03:54.8197279Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:54.8197640Z U c10::BoolType::get() 2025-05-07T20:03:54.8197999Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.8198671Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.8199284Z U c10::Error::what() const 2025-05-07T20:03:54.8199643Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8200098Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8200616Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.8200985Z U c10::IntType::get() 2025-05-07T20:03:54.8201355Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.8201822Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.8202324Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.8202798Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:54.8203176Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:54.8203552Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:54.8203967Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.8204806Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.8205430Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.8205802Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:54.8206143Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.8206491Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:54.8206829Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.8207128Z U c10::SymIntType::get() 2025-05-07T20:03:54.8207476Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:54.8207816Z U c10::TensorType::get() 2025-05-07T20:03:54.8208131Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.8209188Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.8210120Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.8210514Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:54.8211028Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:54.8211713Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:54.8212259Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.8212583Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.8212908Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.8213225Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.8213585Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.8214036Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.8214479Z U c10::cuda::device_count() 2025-05-07T20:03:54.8214811Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.8215198Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.8215564Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.8215950Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.8216336Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.8216712Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.8217417Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.8218271Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.8219332Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8220281Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.8223070Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8223927Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.8224272Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.8224635Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.8225054Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:54.8225439Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:54.8225850Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.8226269Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.8226686Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:54.8227065Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:54.8227441Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:54.8227804Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.8228222Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.8228689Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8229074Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.8229463Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.8229823Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.8230191Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.8230553Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.8230906Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8231282Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.8231636Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.8231990Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.8232335Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.8232789Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8233157Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.8234205Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8235912Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8237630Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8239351Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8241145Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8243027Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8244902Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8246744Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8248486Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8250209Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8251941Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8253683Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8254791Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:03:54.8255216Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:54.8255859Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:54.8256377Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8256800Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8257376Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8257775Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8258212Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.8258654Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8259066Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8259429Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.8259731Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.8260032Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:03:54.8260365Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:03:54.8260693Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.8261054Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.8261649Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.8262528Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.8263470Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8264554Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8265665Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8266613Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8267682Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.8268797Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8269619Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:54.8270016Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8270431Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8270870Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.8271404Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.8272428Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8273287Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:54.8273650Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8274019Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8274373Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.8274729Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.8275140Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8275699Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8276202Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.8276752Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:03:54.8277732Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:54.8278912Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:54.8279651Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.8279982Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.8280300Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.8281161Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.8282374Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8283225Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8283994Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.8284600Z U typeinfo for c10::Error 2025-05-07T20:03:54.8284941Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:54.8285428Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8286034Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8286483Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.8286997Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8287379Z U vtable for c10::Error 2025-05-07T20:03:54.8287975Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8288662Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.8289139Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.8289521Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.8289840Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.8290158Z w __gmon_start__ 2025-05-07T20:03:54.8290439Z w __pthread_key_create 2025-05-07T20:03:54.8290805Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.8291271Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.8291614Z 2025-05-07T20:03:54.8291758Z linux-vdso.so.1 (0x00007fffe25e7000) 2025-05-07T20:03:54.8292070Z libc10.so => not found 2025-05-07T20:03:54.8292320Z libc10_cuda.so => not found 2025-05-07T20:03:54.8292885Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f47e1c00000) 2025-05-07T20:03:54.8306524Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f47e2346000) 2025-05-07T20:03:54.8307193Z libtorch.so => not found 2025-05-07T20:03:54.8307465Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8307735Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8308007Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8308340Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f47e199c000) 2025-05-07T20:03:54.8308768Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f47e22ee000) 2025-05-07T20:03:54.8309180Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f47e22c0000) 2025-05-07T20:03:54.8309556Z libc.so.6 => /lib64/libc.so.6 (0x00007f47e1794000) 2025-05-07T20:03:54.8309875Z libc10.so => not found 2025-05-07T20:03:54.8310375Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f47e2246000) 2025-05-07T20:03:54.8310937Z libtorch.so => not found 2025-05-07T20:03:54.8311182Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8311452Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8311750Z libm.so.6 => /lib64/libm.so.6 (0x00007f47e2169000) 2025-05-07T20:03:54.8312254Z /lib64/ld-linux-x86-64.so.2 (0x00007f47e254b000) 2025-05-07T20:03:54.8312682Z libtorch.so => not found 2025-05-07T20:03:54.8313097Z libc10.so => not found 2025-05-07T20:03:54.8313350Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8313629Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8313977Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f47e2162000) 2025-05-07T20:03:54.8314345Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8314606Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8314866Z libtorch.so => not found 2025-05-07T20:03:54.8315159Z librt.so.1 => /lib64/librt.so.1 (0x00007f47e178f000) 2025-05-07T20:03:54.8315400Z 2025-05-07T20:03:54.8315524Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.8315947Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:54.8316299Z 2025-05-07T20:03:54.8316303Z 2025-05-07T20:03:54.8316459Z Dynamic section at offset 0x189ef8 contains 39 entries: 2025-05-07T20:03:54.8316827Z Tag Type Name/Value 2025-05-07T20:03:54.8317249Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.8317752Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.8318370Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:54.8318890Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:54.8319419Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.8319977Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.8320527Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.8321065Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.8321588Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.8322099Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.8322642Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.8323138Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.8323672Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:54.8324192Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.8324600Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:03:54.8324937Z 0x000000000000000d (FINI) 0x60bac 2025-05-07T20:03:54.8325374Z 0x0000000000000019 (INIT_ARRAY) 0x189258 2025-05-07T20:03:54.8325721Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:03:54.8326058Z 0x000000000000001a (FINI_ARRAY) 0x1892a0 2025-05-07T20:03:54.8326398Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.8326724Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.8327047Z 0x0000000000000005 (STRTAB) 0x4598 2025-05-07T20:03:54.8327360Z 0x0000000000000006 (SYMTAB) 0x12e0 2025-05-07T20:03:54.8327704Z 0x000000000000000a (STRSZ) 47880 (bytes) 2025-05-07T20:03:54.8328062Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.8328396Z 0x0000000000000003 (PLTGOT) 0x18a1a8 2025-05-07T20:03:54.8328747Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:03:54.8329082Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.8329406Z 0x0000000000000017 (JMPREL) 0x131f0 2025-05-07T20:03:54.8329726Z 0x0000000000000007 (RELA) 0x105e0 2025-05-07T20:03:54.8330070Z 0x0000000000000008 (RELASZ) 11280 (bytes) 2025-05-07T20:03:54.8330615Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.8330941Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.8331269Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.8331614Z 0x000000006ffffffe (VERNEED) 0x104e0 2025-05-07T20:03:54.8331947Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.8332269Z 0x000000006ffffff0 (VERSYM) 0x100a0 2025-05-07T20:03:54.8332609Z 0x000000006ffffff9 (RELACOUNT) 245 2025-05-07T20:03:54.8332910Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.8333122Z 2025-05-07T20:03:54.8333233Z ################################################################################ 2025-05-07T20:03:54.8333459Z 2025-05-07T20:03:54.8333463Z 2025-05-07T20:03:54.8333585Z ################################################################################ 2025-05-07T20:03:54.8334068Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8334561Z [CHECK] Listing out library size: 2025-05-07T20:03:54.8335003Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8335380Z 2025-05-07T20:03:54.8335572Z 8 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8335870Z 2025-05-07T20:03:54.8336261Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8337266Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.8337864Z 2025-05-07T20:03:54.8337973Z GLIBC_2.2.5 2025-05-07T20:03:54.8338195Z GLIBC_2.14 2025-05-07T20:03:54.8338323Z 2025-05-07T20:03:54.8338327Z 2025-05-07T20:03:54.8338747Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8339747Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.8340352Z 2025-05-07T20:03:54.8391416Z GLIBCXX_3.4 2025-05-07T20:03:54.8391676Z GLIBCXX_3.4.9 2025-05-07T20:03:54.8392034Z GLIBCXX_3.4.20 2025-05-07T20:03:54.8392295Z GLIBCXX_3.4.21 2025-05-07T20:03:54.8392674Z 2025-05-07T20:03:54.8392683Z 2025-05-07T20:03:54.8412319Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.onDCouQwbN.symbols.txt 2025-05-07T20:03:54.8412825Z 2025-05-07T20:03:54.8438239Z 2025-05-07T20:03:54.8465059Z [CHECK] Total Number of symbols: 501 2025-05-07T20:03:54.8476472Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:03:54.8494068Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.Qe2rmKgVPS.usymbols.txt 2025-05-07T20:03:54.8495535Z 2025-05-07T20:03:54.8513362Z 2025-05-07T20:03:54.8537806Z [CHECK] Listing out undefined symbols (154 total): 2025-05-07T20:03:54.8553090Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8553811Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.8554201Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8554617Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8555036Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8555435Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.8555826Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.8556182Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.8556558Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8556932Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.8557256Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.8557569Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.8557882Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.8558200Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.8558515Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.8558948Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.8559352Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.8559634Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.8559924Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.8560359Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.8560852Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:54.8561668Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8562928Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8563814Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:54.8564420Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.8564866Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:54.8565339Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.8565808Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:54.8566515Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8567617Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8568403Z U c10::BoolType::get() 2025-05-07T20:03:54.8568735Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.8569132Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.8569468Z U c10::IntType::get() 2025-05-07T20:03:54.8569799Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.8570182Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.8570592Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.8571060Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.8571454Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.8572069Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.8572691Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.8573028Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.8573342Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.8573637Z U c10::SymIntType::get() 2025-05-07T20:03:54.8573962Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:54.8574353Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.8574864Z U c10::TensorType::get() 2025-05-07T20:03:54.8575186Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.8576141Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.8577096Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.8577453Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.8577789Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.8578129Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.8578460Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.8578799Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.8579267Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.8579732Z U c10::cuda::current_device() 2025-05-07T20:03:54.8580036Z U c10::cuda::device_count() 2025-05-07T20:03:54.8580366Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.8580745Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.8581129Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.8581512Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.8581912Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.8582324Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.8583064Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.8583981Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.8584881Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8586163Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.8587289Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8588125Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.8588470Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.8588840Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:54.8589281Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:54.8589699Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.8590066Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.8590467Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.8590824Z U c10::throwNullDataPtrError() 2025-05-07T20:03:54.8591157Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.8591479Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:54.8591899Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.8592019Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:54.8592166Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.8592298Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8592502Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.8592641Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.8592768Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.8592885Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.8593004Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.8593138Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8593265Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:54.8593383Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:54.8593517Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:54.8593653Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:54.8593777Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.8593906Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.8594023Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.8594140Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:54.8594436Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.8594574Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:54.8594683Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:54.8594799Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.8594937Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8595060Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.8595186Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8595392Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8595485Z U log2@GLIBC_2.2.5 2025-05-07T20:03:54.8595663Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.8595809Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8595998Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8596125Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.8596220Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.8596347Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.8596469Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.8596563Z U printf@GLIBC_2.2.5 2025-05-07T20:03:54.8596693Z U puts@GLIBC_2.2.5 2025-05-07T20:03:54.8597050Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.8597458Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.8597881Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8598440Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8598837Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8599271Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8599739Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.8600283Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8600427Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8600569Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8600753Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.8600998Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.8601598Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8601742Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:54.8601863Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8601985Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8602115Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.8602230Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.8602415Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8602668Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8602797Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.8602908Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.8603017Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.8603141Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.8603798Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.8604278Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8604595Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8604972Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.8605114Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:54.8605296Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8605464Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.8605642Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8606084Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8606309Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.8606434Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.8606540Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.8606641Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.8606745Z w __gmon_start__ 2025-05-07T20:03:54.8606840Z w __pthread_key_create 2025-05-07T20:03:54.8606983Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.8607182Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8607199Z 2025-05-07T20:03:54.8607316Z linux-vdso.so.1 (0x00007ffc123f2000) 2025-05-07T20:03:54.8607414Z libtorch.so => not found 2025-05-07T20:03:54.8607500Z libc10.so => not found 2025-05-07T20:03:54.8607602Z libc10_cuda.so => not found 2025-05-07T20:03:54.8607696Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8607788Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8607896Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8608061Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f771599c000) 2025-05-07T20:03:54.8608186Z libm.so.6 => /lib64/libm.so.6 (0x00007f771658c000) 2025-05-07T20:03:54.8608343Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f7716536000) 2025-05-07T20:03:54.8608486Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7716508000) 2025-05-07T20:03:54.8608608Z libc.so.6 => /lib64/libc.so.6 (0x00007f7715794000) 2025-05-07T20:03:54.8608731Z /lib64/ld-linux-x86-64.so.2 (0x00007f771666d000) 2025-05-07T20:03:54.8608736Z 2025-05-07T20:03:54.8608855Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.8609083Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:54.8609090Z 2025-05-07T20:03:54.8625249Z 2025-05-07T20:03:54.8626128Z Dynamic section at offset 0x7de050 contains 37 entries: 2025-05-07T20:03:54.8626490Z Tag Type Name/Value 2025-05-07T20:03:54.8627146Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.8627752Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.8628351Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.8628951Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.8629564Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.8630176Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.8630772Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.8631324Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:54.8632118Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.8632725Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.8632917Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.8633199Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:54.8633345Z 0x000000000000000c (INIT) 0x14000 2025-05-07T20:03:54.8633457Z 0x000000000000000d (FINI) 0x5fb3c 2025-05-07T20:03:54.8633588Z 0x0000000000000019 (INIT_ARRAY) 0x7dd548 2025-05-07T20:03:54.8633711Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:03:54.8633824Z 0x000000000000001a (FINI_ARRAY) 0x7dd5a8 2025-05-07T20:03:54.8633981Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.8634112Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.8634228Z 0x0000000000000005 (STRTAB) 0x4240 2025-05-07T20:03:54.8634335Z 0x0000000000000006 (SYMTAB) 0x1330 2025-05-07T20:03:54.8634477Z 0x000000000000000a (STRSZ) 43494 (bytes) 2025-05-07T20:03:54.8634594Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.8634713Z 0x0000000000000003 (PLTGOT) 0x7de2f0 2025-05-07T20:03:54.8634851Z 0x0000000000000002 (PLTRELSZ) 6432 (bytes) 2025-05-07T20:03:54.8634956Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.8635067Z 0x0000000000000017 (JMPREL) 0x11f88 2025-05-07T20:03:54.8635173Z 0x0000000000000007 (RELA) 0xf108 2025-05-07T20:03:54.8635312Z 0x0000000000000008 (RELASZ) 11904 (bytes) 2025-05-07T20:03:54.8635433Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.8635533Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.8635666Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.8635780Z 0x000000006ffffffe (VERNEED) 0xf018 2025-05-07T20:03:54.8635888Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.8635999Z 0x000000006ffffff0 (VERSYM) 0xec26 2025-05-07T20:03:54.8636112Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:03:54.8636210Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.8636215Z 2025-05-07T20:03:54.8636333Z ################################################################################ 2025-05-07T20:03:54.8636338Z 2025-05-07T20:03:54.8636342Z 2025-05-07T20:03:54.8636458Z ################################################################################ 2025-05-07T20:03:54.8636769Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8636871Z [CHECK] Listing out library size: 2025-05-07T20:03:54.8637180Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8637185Z 2025-05-07T20:03:54.8637866Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8639105Z 2025-05-07T20:03:54.8640005Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8640549Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.8640556Z 2025-05-07T20:03:54.8689232Z GLIBC_2.2.5 2025-05-07T20:03:54.8689500Z GLIBC_2.14 2025-05-07T20:03:54.8690808Z 2025-05-07T20:03:54.8690829Z 2025-05-07T20:03:54.8692499Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8693100Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.8693260Z 2025-05-07T20:03:54.8747732Z GLIBCXX_3.4 2025-05-07T20:03:54.8748543Z GLIBCXX_3.4.9 2025-05-07T20:03:54.8748824Z GLIBCXX_3.4.21 2025-05-07T20:03:54.8748842Z 2025-05-07T20:03:54.8748856Z 2025-05-07T20:03:54.8769520Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.LKY76ChkUV.symbols.txt 2025-05-07T20:03:54.8769867Z 2025-05-07T20:03:54.8787512Z 2025-05-07T20:03:54.8813546Z [CHECK] Total Number of symbols: 274 2025-05-07T20:03:54.8827826Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:03:54.8841313Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.I74mykjbFf.usymbols.txt 2025-05-07T20:03:54.8841331Z 2025-05-07T20:03:54.8860850Z 2025-05-07T20:03:54.8883944Z [CHECK] Listing out undefined symbols (130 total): 2025-05-07T20:03:54.8905213Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8905495Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.8905880Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8906081Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8906237Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8906518Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.8906682Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.8906813Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.8906972Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8907075Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.8907192Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.8907301Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.8907410Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.8907550Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.8907663Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.8907880Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:54.8908486Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8909157Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8909344Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:54.8909459Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:54.8909949Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8910060Z U at::get_thread_num() 2025-05-07T20:03:54.8910177Z U at::internal::set_thread_num(int) 2025-05-07T20:03:54.8910766Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8911057Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:54.8911240Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8911410Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.8911561Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8911703Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.8911976Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:54.8912112Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:54.8912270Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.8912566Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.8912695Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.8912892Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:54.8913050Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.8913164Z U c10::TensorType::get() 2025-05-07T20:03:54.8913282Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.8914055Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.8914200Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.8914319Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.8914441Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.8914568Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.8914685Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.8914816Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.8915078Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.8915213Z U c10::cuda::device_count() 2025-05-07T20:03:54.8915354Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.8915494Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.8915660Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.8915811Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.8915973Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.8916116Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.8916664Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.8916927Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.8917460Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8917812Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.8917935Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.8918064Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.8918220Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:54.8918395Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:54.8918640Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.8918783Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.8918917Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.8919041Z U c10::throwNullDataPtrError() 2025-05-07T20:03:54.8919157Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.8919267Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:54.8919460Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.8919576Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:54.8920859Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.8920990Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8921140Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.8921294Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.8921420Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.8921595Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.8921714Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.8921841Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8921979Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:54.8922119Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:54.8922239Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:54.8922360Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.8922485Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.8922605Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.8922884Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.8923016Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:54.8923123Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:54.8923241Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.8923374Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8923492Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.8923633Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8923761Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8923947Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.8924074Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8924174Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.8924278Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.8924388Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.8924508Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.8924617Z U printf@GLIBC_2.2.5 2025-05-07T20:03:54.8924709Z U puts@GLIBC_2.2.5 2025-05-07T20:03:54.8925046Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.8925433Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.8925949Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8926345Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8926852Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8926996Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8927136Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8927385Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.8927943Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8928098Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8928215Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8928328Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.8928519Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.8928698Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8928819Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.8928917Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.8929053Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.8929635Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.8930095Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8930344Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8930689Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.8930857Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8931018Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.8931175Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8931498Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8931721Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.8931830Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.8931939Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.8932035Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.8932119Z w __gmon_start__ 2025-05-07T20:03:54.8932209Z w __pthread_key_create 2025-05-07T20:03:54.8932354Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.8932578Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8932585Z 2025-05-07T20:03:54.8950183Z linux-vdso.so.1 (0x00007ffd453dc000) 2025-05-07T20:03:54.8950997Z libc10.so => not found 2025-05-07T20:03:54.8951341Z libc10_cuda.so => not found 2025-05-07T20:03:54.8951611Z libtorch.so => not found 2025-05-07T20:03:54.8951904Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8952201Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8952716Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8953237Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fddd2839000) 2025-05-07T20:03:54.8953721Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fddd27e3000) 2025-05-07T20:03:54.8954166Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fddd27b5000) 2025-05-07T20:03:54.8954531Z libc.so.6 => /lib64/libc.so.6 (0x00007fddd25ad000) 2025-05-07T20:03:54.8954905Z libm.so.6 => /lib64/libm.so.6 (0x00007fddd24d2000) 2025-05-07T20:03:54.8955280Z /lib64/ld-linux-x86-64.so.2 (0x00007fddd2b9b000) 2025-05-07T20:03:54.8955299Z 2025-05-07T20:03:54.8955623Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.8956442Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:54.8956459Z 2025-05-07T20:03:54.8983229Z 2025-05-07T20:03:54.8983778Z Dynamic section at offset 0xc06b8 contains 37 entries: 2025-05-07T20:03:54.8983920Z Tag Type Name/Value 2025-05-07T20:03:54.8984148Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.8984380Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.8984681Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.8984901Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.8985107Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.8985355Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.8985603Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.8985986Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.8986186Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.8986390Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.8986747Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:54.8986952Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.8987068Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:03:54.8987191Z 0x000000000000000d (FINI) 0x1813c 2025-05-07T20:03:54.8987304Z 0x0000000000000019 (INIT_ARRAY) 0xc13b0 2025-05-07T20:03:54.8987432Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:03:54.8987558Z 0x000000000000001a (FINI_ARRAY) 0xc13d0 2025-05-07T20:03:54.8987681Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.8987799Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.8987919Z 0x0000000000000005 (STRTAB) 0x22f0 2025-05-07T20:03:54.8988024Z 0x0000000000000006 (SYMTAB) 0x928 2025-05-07T20:03:54.8988156Z 0x000000000000000a (STRSZ) 20379 (bytes) 2025-05-07T20:03:54.8988279Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.8988395Z 0x0000000000000003 (PLTGOT) 0xc1948 2025-05-07T20:03:54.8988530Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:03:54.8988639Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.8988752Z 0x0000000000000017 (JMPREL) 0x8298 2025-05-07T20:03:54.8988858Z 0x0000000000000007 (RELA) 0x7578 2025-05-07T20:03:54.8989009Z 0x0000000000000008 (RELASZ) 3360 (bytes) 2025-05-07T20:03:54.8989135Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.8989233Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.8989359Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.8989469Z 0x000000006ffffffe (VERNEED) 0x74b8 2025-05-07T20:03:54.8989586Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:54.8989698Z 0x000000006ffffff0 (VERSYM) 0x728c 2025-05-07T20:03:54.8989804Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:03:54.8989912Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.8989917Z 2025-05-07T20:03:54.8990031Z ################################################################################ 2025-05-07T20:03:54.8990036Z 2025-05-07T20:03:54.8990040Z 2025-05-07T20:03:54.8990148Z ################################################################################ 2025-05-07T20:03:54.8990493Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:54.8990597Z [CHECK] Listing out library size: 2025-05-07T20:03:54.8990921Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:54.8990925Z 2025-05-07T20:03:54.8996662Z 11 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:54.8997772Z 2025-05-07T20:03:54.8998779Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:54.8999352Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.8999428Z 2025-05-07T20:03:54.9457067Z GLIBC_2.2.5 2025-05-07T20:03:54.9457316Z GLIBC_2.3 2025-05-07T20:03:54.9457580Z GLIBC_2.14 2025-05-07T20:03:54.9457599Z 2025-05-07T20:03:54.9457932Z 2025-05-07T20:03:54.9459472Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:54.9461214Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.9461229Z 2025-05-07T20:03:54.9914325Z GLIBCXX_3.4 2025-05-07T20:03:54.9914604Z GLIBCXX_3.4.9 2025-05-07T20:03:54.9914849Z GLIBCXX_3.4.11 2025-05-07T20:03:54.9915399Z GLIBCXX_3.4.15 2025-05-07T20:03:54.9915638Z GLIBCXX_3.4.18 2025-05-07T20:03:54.9915867Z GLIBCXX_3.4.20 2025-05-07T20:03:54.9916094Z GLIBCXX_3.4.21 2025-05-07T20:03:54.9916124Z 2025-05-07T20:03:54.9916158Z 2025-05-07T20:03:54.9933967Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.N4az5zermy.symbols.txt 2025-05-07T20:03:54.9934014Z 2025-05-07T20:03:55.0343974Z 2025-05-07T20:03:55.0372887Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:03:55.0401839Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:55.0420184Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.R2qg106AF3.usymbols.txt 2025-05-07T20:03:55.0420217Z 2025-05-07T20:03:55.0449599Z 2025-05-07T20:03:55.0475381Z [CHECK] Listing out undefined symbols (192 total): 2025-05-07T20:03:55.0499231Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0499723Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0500346Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.0500522Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.0500715Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.0500832Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.0500969Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.0501083Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.0501195Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.0501318Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.0501428Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.0501533Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.0501635Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.0501763Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.0501861Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.0502063Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:55.0502218Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:55.0502330Z U at::RecordFunction::end() 2025-05-07T20:03:55.0502460Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:55.0502623Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:55.0502938Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:55.0503257Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:55.0503603Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:55.0503820Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:55.0504488Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.0504909Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:55.0505146Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.0505328Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:55.0505504Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:55.0505641Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:55.0505737Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:55.0505852Z U c10::AnyType::get() 2025-05-07T20:03:55.0506000Z U c10::BoolType::get() 2025-05-07T20:03:55.0506164Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.0506365Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:55.0506482Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:55.0507020Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:55.0507690Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:55.0508076Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.0508186Z U c10::Error::what() const 2025-05-07T20:03:55.0508302Z U c10::FloatType::get() 2025-05-07T20:03:55.0508411Z U c10::GradMode::is_enabled() 2025-05-07T20:03:55.0508527Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:55.0508702Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:55.0508819Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:55.0508931Z U c10::IValue::isBoolList() const 2025-05-07T20:03:55.0509053Z U c10::IValue::isDoubleList() const 2025-05-07T20:03:55.0509167Z U c10::IValue::isIntList() const 2025-05-07T20:03:55.0509284Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:55.0509398Z U c10::IValue::isTensorList() const 2025-05-07T20:03:55.0509552Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.0509648Z U c10::IntType::get() 2025-05-07T20:03:55.0510144Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.0510330Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.0510452Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.0510582Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.0510723Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.0510949Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.0511238Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:55.0511362Z U c10::StringType::get() 2025-05-07T20:03:55.0511509Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:55.0511651Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.0511814Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.0511970Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:55.0512569Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.0512723Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.0512900Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:55.0513069Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:55.0513205Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.0513402Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:55.0513514Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.0513635Z U c10::SymIntType::get() 2025-05-07T20:03:55.0513787Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:55.0513892Z U c10::TensorType::get() 2025-05-07T20:03:55.0514015Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.0514469Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.0515002Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.0515283Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.0515786Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0516137Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.0516746Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0517077Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.0517269Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.0517404Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.0517560Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.0517943Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.0518080Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.0518246Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.0518399Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:55.0518556Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.0518757Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.0518880Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.0519156Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:55.0519453Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.0519552Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.0519740Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.0519838Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:55.0519935Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.0520049Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.0520145Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.0520358Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.0520489Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.0520606Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:55.0520824Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.0521224Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.0521650Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.0522064Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0522658Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0523065Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0523484Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0524035Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0524401Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0525096Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0525411Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.0525775Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.0526146Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.0526254Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.0526365Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.0526513Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0526643Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0526815Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.0526956Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.0527088Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:55.0527315Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.0527891Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0528012Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.0528126Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.0528256Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.0528363Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.0528472Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.0528659Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0528905Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0529022Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.0529193Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0529339Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.0529774Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0529920Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.0530021Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.0530140Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.0530229Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.0530355Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.0530922Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.0531375Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.0531626Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.0531742Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:55.0532033Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:55.0532210Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:55.0532401Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:55.0532587Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:55.0532921Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:55.0533063Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:55.0533263Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:55.0533434Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:55.0533548Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:55.0533669Z U torch::autograd::Node::metadata() 2025-05-07T20:03:55.0533799Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:55.0534035Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.0534301Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:55.0534435Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:55.0534637Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:55.0534855Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:55.0537441Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:55.0539101Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:55.0539246Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:55.0539434Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:55.0540211Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:55.0540386Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:55.0540790Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:55.0541138Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.0541248Z U typeinfo for c10::Error 2025-05-07T20:03:55.0541396Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.0541521Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:55.0541641Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.0541765Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.0541891Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:55.0542033Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0542182Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.0542350Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:55.0542499Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0542595Z U vtable for c10::Error 2025-05-07T20:03:55.0542919Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0543047Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.0543259Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.0543384Z U vtable for torch::autograd::Node 2025-05-07T20:03:55.0543552Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.0543661Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.0543773Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.0543896Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.0543987Z w __gmon_start__ 2025-05-07T20:03:55.0544093Z w __pthread_key_create 2025-05-07T20:03:55.0544199Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.0544306Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.0544448Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.0544706Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:55.0544712Z 2025-05-07T20:03:55.0550363Z linux-vdso.so.1 (0x00007ffc387f0000) 2025-05-07T20:03:55.0550757Z libc10.so => not found 2025-05-07T20:03:55.0552278Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f392740a000) 2025-05-07T20:03:55.0552665Z libtorch.so => not found 2025-05-07T20:03:55.0553145Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f3928102000) 2025-05-07T20:03:55.0553427Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0553543Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0553717Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f39271a6000) 2025-05-07T20:03:55.0553880Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f39280d2000) 2025-05-07T20:03:55.0554071Z libc.so.6 => /lib64/libc.so.6 (0x00007f3926f9e000) 2025-05-07T20:03:55.0554250Z /lib64/ld-linux-x86-64.so.2 (0x00007f3928111000) 2025-05-07T20:03:55.0554349Z libc10.so => not found 2025-05-07T20:03:55.0554448Z libc10_cuda.so => not found 2025-05-07T20:03:55.0554832Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f3926a00000) 2025-05-07T20:03:55.0554933Z libtorch.so => not found 2025-05-07T20:03:55.0555039Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0555192Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0555293Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0555455Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f392807a000) 2025-05-07T20:03:55.0555556Z libtorch.so => not found 2025-05-07T20:03:55.0555659Z libc10.so => not found 2025-05-07T20:03:55.0555761Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0555861Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0556052Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f3928073000) 2025-05-07T20:03:55.0556178Z libm.so.6 => /lib64/libm.so.6 (0x00007f3926925000) 2025-05-07T20:03:55.0556272Z libc10.so => not found 2025-05-07T20:03:55.0556642Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f39268ad000) 2025-05-07T20:03:55.0556749Z libtorch.so => not found 2025-05-07T20:03:55.0556849Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0556960Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0557090Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0557199Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0557306Z libtorch.so => not found 2025-05-07T20:03:55.0557473Z librt.so.1 => /lib64/librt.so.1 (0x00007f392806a000) 2025-05-07T20:03:55.0557481Z 2025-05-07T20:03:55.0557602Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.0557910Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:55.0557918Z 2025-05-07T20:03:55.0591334Z 2025-05-07T20:03:55.0592675Z Dynamic section at offset 0xa3d920 contains 37 entries: 2025-05-07T20:03:55.0593173Z Tag Type Name/Value 2025-05-07T20:03:55.0593769Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.0594442Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:55.0595049Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.0595696Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:55.0596296Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.0596941Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.0597536Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.0598118Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.0598710Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.0599344Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.0600179Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:03:55.0600738Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.0601086Z 0x000000000000000c (INIT) 0x189000 2025-05-07T20:03:55.0601209Z 0x000000000000000d (FINI) 0x8a73b8 2025-05-07T20:03:55.0601330Z 0x0000000000000019 (INIT_ARRAY) 0xa32f68 2025-05-07T20:03:55.0601485Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:03:55.0601789Z 0x000000000000001a (FINI_ARRAY) 0xa33068 2025-05-07T20:03:55.0601918Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.0602066Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:55.0602184Z 0x0000000000000005 (STRTAB) 0x20fc8 2025-05-07T20:03:55.0602355Z 0x0000000000000006 (SYMTAB) 0x73a8 2025-05-07T20:03:55.0602605Z 0x000000000000000a (STRSZ) 1247927 (bytes) 2025-05-07T20:03:55.0602753Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.0602877Z 0x0000000000000003 (PLTGOT) 0xa3ebb0 2025-05-07T20:03:55.0603019Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:03:55.0603159Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.0603325Z 0x0000000000000017 (JMPREL) 0x17dc38 2025-05-07T20:03:55.0603444Z 0x0000000000000007 (RELA) 0x153de8 2025-05-07T20:03:55.0603604Z 0x0000000000000008 (RELASZ) 171600 (bytes) 2025-05-07T20:03:55.0603737Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.0603843Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.0603976Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.0604125Z 0x000000006ffffffe (VERNEED) 0x153cd8 2025-05-07T20:03:55.0604239Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:55.0604369Z 0x000000006ffffff0 (VERSYM) 0x151a80 2025-05-07T20:03:55.0604501Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:03:55.0604610Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.0604634Z 2025-05-07T20:03:55.0604756Z ################################################################################ 2025-05-07T20:03:55.0604762Z 2025-05-07T20:03:55.0604766Z 2025-05-07T20:03:55.0604906Z ################################################################################ 2025-05-07T20:03:55.0605197Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.0605308Z [CHECK] Listing out library size: 2025-05-07T20:03:55.0605609Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.0605614Z 2025-05-07T20:03:55.0605844Z 211 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.0606289Z 2025-05-07T20:03:55.0607615Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.0608173Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.0608180Z 2025-05-07T20:03:55.0997763Z GLIBC_2.2.5 2025-05-07T20:03:55.0998078Z GLIBC_2.14 2025-05-07T20:03:55.0998601Z 2025-05-07T20:03:55.0998668Z 2025-05-07T20:03:55.0999400Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.0999965Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.0999971Z 2025-05-07T20:03:55.1394641Z GLIBCXX_3.4 2025-05-07T20:03:55.1395340Z GLIBCXX_3.4.9 2025-05-07T20:03:55.1395953Z GLIBCXX_3.4.11 2025-05-07T20:03:55.1396578Z GLIBCXX_3.4.14 2025-05-07T20:03:55.1397212Z GLIBCXX_3.4.18 2025-05-07T20:03:55.1397787Z GLIBCXX_3.4.20 2025-05-07T20:03:55.1398384Z GLIBCXX_3.4.21 2025-05-07T20:03:55.1398742Z 2025-05-07T20:03:55.1398757Z 2025-05-07T20:03:55.1418019Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.Z327jqX7nR.symbols.txt 2025-05-07T20:03:55.1419555Z 2025-05-07T20:03:55.1774532Z 2025-05-07T20:03:55.1802097Z [CHECK] Total Number of symbols: 5040 2025-05-07T20:03:55.1827548Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:03:55.1845413Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.06Yag5LMW2.usymbols.txt 2025-05-07T20:03:55.1846186Z 2025-05-07T20:03:55.1875009Z 2025-05-07T20:03:55.1902536Z [CHECK] Listing out undefined symbols (253 total): 2025-05-07T20:03:55.1930031Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.1931235Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.1931835Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.1932285Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.1932723Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.1933189Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.1933608Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.1934003Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.1934415Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.1934797Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.1935189Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.1935542Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.1935864Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.1936212Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.1936541Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.1936889Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.1937219Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.1937565Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.1937908Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.1938221Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.1938543Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.1938974Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:55.1939966Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1941311Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1942594Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1943474Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:55.1944222Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1944963Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:55.1945538Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:55.1946557Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:55.1947733Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1948400Z U at::detail::getCUDAHooks() 2025-05-07T20:03:55.1948718Z U at::detail::getHIPHooks() 2025-05-07T20:03:55.1949087Z U at::get_thread_num() 2025-05-07T20:03:55.1949383Z U at::globalContext() 2025-05-07T20:03:55.1949698Z U at::internal::set_thread_num(int) 2025-05-07T20:03:55.1950070Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:55.1950559Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.1951059Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1951512Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:03:55.1952130Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:03:55.1953088Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.1954064Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.1955260Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.1955863Z U c10::Error::what() const 2025-05-07T20:03:55.1956192Z U c10::GradMode::is_enabled() 2025-05-07T20:03:55.1956512Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:55.1956899Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.1957343Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1957813Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:55.1958225Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:03:55.1958579Z U c10::IValue::isTensorList() const 2025-05-07T20:03:55.1958960Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.1959410Z U c10::IntType::get() 2025-05-07T20:03:55.1960072Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.1960810Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.1961190Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.1961513Z U c10::NoneType::get() 2025-05-07T20:03:55.1961906Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.1962358Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:55.1962710Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:55.1963075Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.1963441Z U c10::StringType::get() 2025-05-07T20:03:55.1963771Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.1964162Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.1964803Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.1965425Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.1965777Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.1966131Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:55.1966818Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:55.1967451Z U c10::TensorType::get() 2025-05-07T20:03:55.1969935Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:55.1970929Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.1971925Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.1972914Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:03:55.1973373Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.1973738Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.1974083Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.1974413Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.1974730Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.1975063Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.1975504Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.1975960Z U c10::cuda::device_count() 2025-05-07T20:03:55.1976280Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.1976650Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.1977029Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.1977393Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.1977795Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.1978158Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.1978982Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.1980057Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.1981933Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.1983388Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.1984294Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.1985273Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.1986614Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.1987504Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:03:55.1987883Z U c10::get_default_dtype() 2025-05-07T20:03:55.1988378Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:55.1988992Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:55.1989428Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.1989777Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.1990188Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.1990895Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.1991552Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:03:55.1992055Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:03:55.1992687Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:03:55.1993210Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.1993621Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.1994042Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:03:55.1994499Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.1994909Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.1995362Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.1995725Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.1996128Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.1996510Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.1996874Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.1997245Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.1997586Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.1997955Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.1998323Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.1998801Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.1999137Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.1999462Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.1999812Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.2000154Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.2001121Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2002754Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2004439Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2006090Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2007758Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2009476Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2011064Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:55.2012646Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:55.2014353Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2016107Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:55.2017911Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2019783Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:55.2021468Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:55.2023472Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2025437Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:55.2027148Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:55.2028968Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2030865Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:55.2032900Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2034780Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:55.2036655Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:55.2038634Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2040595Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2042544Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2044539Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2046474Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2048418Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2050431Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:55.2051874Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.2052263Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.2052633Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.2052991Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.2053636Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:03:55.2054297Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.2054710Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.2055081Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.2055891Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:03:55.2056968Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.2057564Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.2057843Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.2058113Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.2058406Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.2058738Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.2059154Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.2059792Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.2060588Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.2061453Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2062485Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.2063470Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2064389Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2065327Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.2066384Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2067344Z U std::__cxx11::basic_string, std::allocator >::find(char, unsigned long) const@GLIBCXX_3.4.21 2025-05-07T20:03:55.2068161Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2068929Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.2070051Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2071228Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.2072062Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.2072737Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:55.2073324Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:55.2073695Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.2074048Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.2074419Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:55.2074812Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.2075216Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.2075639Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.2076082Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.2076584Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.2077543Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.2078437Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:55.2078895Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:03:55.2079358Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:55.2079790Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:55.2080147Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.2080520Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.2080874Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.2081233Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.2081687Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:03:55.2082106Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.2082757Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2083476Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:03:55.2083917Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.2084468Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.2084953Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.2085394Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2086017Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.2086518Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:55.2087059Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.2087501Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.2087876Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.2088198Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.2088501Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.2088815Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.2089679Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.2090898Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.2091748Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.2093271Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:03:55.2094875Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:03:55.2095749Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.2096643Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:03:55.2097333Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:03:55.2098024Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:03:55.2098884Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:03:55.2099739Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:03:55.2100309Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:03:55.2101057Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:03:55.2101753Z U typeinfo for c10::Error 2025-05-07T20:03:55.2102062Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:55.2102507Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:03:55.2102854Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.2103217Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.2103672Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.2104180Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.2104645Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.2105035Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.2105448Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.2105830Z U vtable for c10::Error 2025-05-07T20:03:55.2106343Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.2106982Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.2107410Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:55.2107745Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.2108045Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.2108342Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.2108638Z w __gmon_start__ 2025-05-07T20:03:55.2108899Z w __pthread_key_create 2025-05-07T20:03:55.2109191Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.2109498Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.2109855Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.2110291Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.2110607Z 2025-05-07T20:03:55.2110707Z linux-vdso.so.1 (0x00007ffd94d04000) 2025-05-07T20:03:55.2110975Z libc10.so => not found 2025-05-07T20:03:55.2111216Z libc10_cuda.so => not found 2025-05-07T20:03:55.2111728Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f2c28800000) 2025-05-07T20:03:55.2112692Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f2c27a00000) 2025-05-07T20:03:55.2113655Z libtorch.so => not found 2025-05-07T20:03:55.2113912Z libtorch_cpu.so => not found 2025-05-07T20:03:55.2114204Z libtorch_cuda.so => not found 2025-05-07T20:03:55.2114484Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.2114840Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2c2779c000) 2025-05-07T20:03:55.2115281Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2c366e8000) 2025-05-07T20:03:55.2115670Z libc.so.6 => /lib64/libc.so.6 (0x00007f2c27594000) 2025-05-07T20:03:55.2116005Z libc10.so => not found 2025-05-07T20:03:55.2116522Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f2c3666e000) 2025-05-07T20:03:55.2117111Z libtorch.so => not found 2025-05-07T20:03:55.2117366Z libtorch_cpu.so => not found 2025-05-07T20:03:55.2117649Z libtorch_cuda.so => not found 2025-05-07T20:03:55.2117949Z libm.so.6 => /lib64/libm.so.6 (0x00007f2c36591000) 2025-05-07T20:03:55.2118332Z /lib64/ld-linux-x86-64.so.2 (0x00007f2c3671c000) 2025-05-07T20:03:55.2118675Z libtorch.so => not found 2025-05-07T20:03:55.2118924Z libc10.so => not found 2025-05-07T20:03:55.2119183Z libc10_cuda.so => not found 2025-05-07T20:03:55.2119450Z libtorch_cpu.so => not found 2025-05-07T20:03:55.2119736Z libtorch_cuda.so => not found 2025-05-07T20:03:55.2120011Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.2120358Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2c28daa000) 2025-05-07T20:03:55.2120716Z libtorch_cpu.so => not found 2025-05-07T20:03:55.2120998Z libtorch_cuda.so => not found 2025-05-07T20:03:55.2121310Z libtorch.so => not found 2025-05-07T20:03:55.2121618Z librt.so.1 => /lib64/librt.so.1 (0x00007f2c36588000) 2025-05-07T20:03:55.2122046Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2c36583000) 2025-05-07T20:03:55.2122522Z 2025-05-07T20:03:55.2122635Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.2123129Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:55.2123487Z 2025-05-07T20:03:55.2123551Z 2025-05-07T20:03:55.2123733Z Dynamic section at offset 0xd2d8688 contains 38 entries: 2025-05-07T20:03:55.2124116Z Tag Type Name/Value 2025-05-07T20:03:55.2124551Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.2125175Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.2125691Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:55.2126181Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:55.2126693Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.2127177Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.2127659Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.2128160Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.2128650Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.2129138Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.2129609Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.2130125Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:03:55.2130635Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.2131014Z 0x000000000000000c (INIT) 0x19c000 2025-05-07T20:03:55.2131336Z 0x000000000000000d (FINI) 0x73d58c 2025-05-07T20:03:55.2131652Z 0x0000000000000019 (INIT_ARRAY) 0xd2d69c0 2025-05-07T20:03:55.2131998Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:03:55.2132328Z 0x000000000000001a (FINI_ARRAY) 0xd2d6b48 2025-05-07T20:03:55.2132662Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.2132995Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:55.2133304Z 0x0000000000000005 (STRTAB) 0x25568 2025-05-07T20:03:55.2133606Z 0x0000000000000006 (SYMTAB) 0x7cd0 2025-05-07T20:03:55.2133934Z 0x000000000000000a (STRSZ) 1383267 (bytes) 2025-05-07T20:03:55.2134285Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.2134604Z 0x0000000000000003 (PLTGOT) 0xd2d8928 2025-05-07T20:03:55.2134954Z 0x0000000000000002 (PLTRELSZ) 20640 (bytes) 2025-05-07T20:03:55.2135284Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.2135597Z 0x0000000000000017 (JMPREL) 0x196378 2025-05-07T20:03:55.2135920Z 0x0000000000000007 (RELA) 0x179950 2025-05-07T20:03:55.2136247Z 0x0000000000000008 (RELASZ) 117288 (bytes) 2025-05-07T20:03:55.2136597Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.2136900Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.2137223Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.2137547Z 0x000000006ffffffe (VERNEED) 0x179830 2025-05-07T20:03:55.2137872Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:55.2138188Z 0x000000006ffffff0 (VERSYM) 0x1770cc 2025-05-07T20:03:55.2138497Z 0x000000006ffffff9 (RELACOUNT) 447 2025-05-07T20:03:55.2138797Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.2138989Z 2025-05-07T20:03:55.2139092Z ################################################################################ 2025-05-07T20:03:55.2139352Z 2025-05-07T20:03:55.2139356Z 2025-05-07T20:03:55.2139465Z ################################################################################ 2025-05-07T20:03:55.2139960Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.2140487Z [CHECK] Listing out library size: 2025-05-07T20:03:55.2140976Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.2141359Z 2025-05-07T20:03:55.2141573Z 188 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.2141910Z 2025-05-07T20:03:55.2142305Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.2143331Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.2143923Z 2025-05-07T20:03:55.3000402Z GLIBC_2.2.5 2025-05-07T20:03:55.3001056Z GLIBC_2.3 2025-05-07T20:03:55.3015781Z GLIBC_2.14 2025-05-07T20:03:55.3016204Z 2025-05-07T20:03:55.3016209Z 2025-05-07T20:03:55.3016857Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.3018012Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.3018682Z 2025-05-07T20:03:55.3954992Z GLIBCXX_3.4 2025-05-07T20:03:55.3955658Z GLIBCXX_3.4.9 2025-05-07T20:03:55.3956276Z GLIBCXX_3.4.20 2025-05-07T20:03:55.3956855Z GLIBCXX_3.4.21 2025-05-07T20:03:55.3957212Z 2025-05-07T20:03:55.3957227Z 2025-05-07T20:03:55.3975262Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.gWFwnBJAKG.symbols.txt 2025-05-07T20:03:55.4903421Z 2025-05-07T20:03:55.4903465Z 2025-05-07T20:03:55.4946437Z [CHECK] Total Number of symbols: 12561 2025-05-07T20:03:55.4995060Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:03:55.5012417Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.LqlLuG3lwM.usymbols.txt 2025-05-07T20:03:55.5014052Z 2025-05-07T20:03:55.5061117Z 2025-05-07T20:03:55.5085393Z [CHECK] Listing out undefined symbols (175 total): 2025-05-07T20:03:55.5103420Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.5105270Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.5106320Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.5107495Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.5108665Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.5109794Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.5110907Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.5111950Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.5113272Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.5114350Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.5114955Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.5115281Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.5115607Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.5115919Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.5116249Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.5116564Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.5116894Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.5117207Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.5117512Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.5118013Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.5118337Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.5118738Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:55.5119328Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:55.5119883Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:55.5120555Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:55.5121155Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:55.5121792Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:55.5122785Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.5123710Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:55.5124197Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.5124648Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:55.5125088Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:55.5125509Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.5125993Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.5126395Z U c10::BoolType::get() 2025-05-07T20:03:55.5126729Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.5127169Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:55.5127551Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:55.5128263Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:55.5129485Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:55.5130546Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.5131119Z U c10::Error::what() const 2025-05-07T20:03:55.5131409Z U c10::FloatType::get() 2025-05-07T20:03:55.5131753Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.5132182Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.5132586Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.5132932Z U c10::IntType::get() 2025-05-07T20:03:55.5133279Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.5133669Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.5134015Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.5134350Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.5134721Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:55.5135098Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.5135488Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.5136130Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.5136783Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.5137153Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:03:55.5137506Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:55.5137890Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.5138255Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:55.5138615Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:03:55.5138968Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:55.5139308Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:55.5139691Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:55.5140022Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.5140336Z U c10::SymIntType::get() 2025-05-07T20:03:55.5140636Z U c10::TensorType::get() 2025-05-07T20:03:55.5140936Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.5141856Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.5142774Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.5143130Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.5143474Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.5143801Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.5144136Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.5144455Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.5144909Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.5145345Z U c10::cuda::device_count() 2025-05-07T20:03:55.5145664Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.5146024Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.5146383Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.5146766Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.5147325Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.5147714Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.5148459Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.5149335Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.5150394Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.5151441Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.5152630Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.5153461Z U c10::get_default_dtype() 2025-05-07T20:03:55.5153788Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.5154137Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.5154697Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.5155338Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.5155814Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.5156162Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.5156558Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.5156992Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:03:55.5157375Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:03:55.5157747Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.5158094Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:55.5158480Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.5158878Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.5159315Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.5159747Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.5160110Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.5160528Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.5160984Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.5161361Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.5161736Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.5162088Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.5162452Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.5162798Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.5163161Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.5163537Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.5163885Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.5164237Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.5164582Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.5164941Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.5165405Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.5165903Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.5166400Z U float at::Tensor::item() const 2025-05-07T20:03:55.5166743Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.5167136Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.5167481Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.5167767Z U int at::Tensor::item() const 2025-05-07T20:03:55.5168090Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.5168453Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.5168865Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.5169261Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.5169635Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.5169969Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.5170244Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.5170522Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.5170861Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.5171417Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.5172218Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.5173127Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.5174135Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.5176253Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.5177149Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.5177623Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.5178133Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.5178273Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.5178409Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.5178585Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.5178815Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.5179383Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.5179506Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.5179619Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.5179736Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.5179860Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.5179964Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.5180139Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.5180386Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.5180509Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.5180611Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.5180718Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.5180833Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.5181396Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.5181848Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.5182093Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.5182451Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.5182554Z U typeinfo for c10::Error 2025-05-07T20:03:55.5182698Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.5182849Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.5183012Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.5183106Z U vtable for c10::Error 2025-05-07T20:03:55.5183421Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.5183678Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.5183844Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.5183982Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.5184093Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.5184218Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.5184305Z w __gmon_start__ 2025-05-07T20:03:55.5184404Z w __pthread_key_create 2025-05-07T20:03:55.5184543Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.5184771Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.5184804Z 2025-05-07T20:03:55.5184948Z linux-vdso.so.1 (0x00007ffda91fd000) 2025-05-07T20:03:55.5185033Z libc10.so => not found 2025-05-07T20:03:55.5185125Z libc10_cuda.so => not found 2025-05-07T20:03:55.5185597Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fbbd700a000) 2025-05-07T20:03:55.5185885Z libtorch.so => not found 2025-05-07T20:03:55.5185974Z libtorch_cpu.so => not found 2025-05-07T20:03:55.5186064Z libtorch_cuda.so => not found 2025-05-07T20:03:55.5186172Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.5186519Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbbd6da6000) 2025-05-07T20:03:55.5186683Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbbe3722000) 2025-05-07T20:03:55.5186824Z libc.so.6 => /lib64/libc.so.6 (0x00007fbbd6b9e000) 2025-05-07T20:03:55.5186954Z /lib64/ld-linux-x86-64.so.2 (0x00007fbbe3756000) 2025-05-07T20:03:55.5187040Z libc10.so => not found 2025-05-07T20:03:55.5187155Z libc10_cuda.so => not found 2025-05-07T20:03:55.5187523Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007fbbd6600000) 2025-05-07T20:03:55.5187982Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007fbbe3715000) 2025-05-07T20:03:55.5188077Z libtorch.so => not found 2025-05-07T20:03:55.5188185Z libtorch_cpu.so => not found 2025-05-07T20:03:55.5188282Z libtorch_cuda.so => not found 2025-05-07T20:03:55.5188382Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.5188561Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fbbe36bd000) 2025-05-07T20:03:55.5188687Z libm.so.6 => /lib64/libm.so.6 (0x00007fbbe35e2000) 2025-05-07T20:03:55.5188775Z libc10.so => not found 2025-05-07T20:03:55.5189156Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007fbbe356a000) 2025-05-07T20:03:55.5189246Z libtorch.so => not found 2025-05-07T20:03:55.5189344Z libtorch_cpu.so => not found 2025-05-07T20:03:55.5189440Z libtorch_cuda.so => not found 2025-05-07T20:03:55.5189546Z libtorch.so => not found 2025-05-07T20:03:55.5189637Z libc10.so => not found 2025-05-07T20:03:55.5189731Z libtorch_cpu.so => not found 2025-05-07T20:03:55.5189843Z libtorch_cuda.so => not found 2025-05-07T20:03:55.5190024Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fbbe3561000) 2025-05-07T20:03:55.5190121Z libtorch_cpu.so => not found 2025-05-07T20:03:55.5190218Z libtorch_cuda.so => not found 2025-05-07T20:03:55.5190323Z libtorch.so => not found 2025-05-07T20:03:55.5190463Z librt.so.1 => /lib64/librt.so.1 (0x00007fbbd6b99000) 2025-05-07T20:03:55.5190469Z 2025-05-07T20:03:55.5190579Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.5190860Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:55.5190866Z 2025-05-07T20:03:55.5210620Z 2025-05-07T20:03:55.5211579Z Dynamic section at offset 0xbaf1f50 contains 38 entries: 2025-05-07T20:03:55.5211976Z Tag Type Name/Value 2025-05-07T20:03:55.5212428Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.5212804Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.5213052Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:55.5213253Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.5213525Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.5213783Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.5214012Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.5214220Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.5214422Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.5214678Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.5214900Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.5215167Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:03:55.5215367Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.5215490Z 0x000000000000000c (INIT) 0x448000 2025-05-07T20:03:55.5215617Z 0x000000000000000d (FINI) 0x1fced1c 2025-05-07T20:03:55.5215743Z 0x0000000000000019 (INIT_ARRAY) 0xbaea2f0 2025-05-07T20:03:55.5215891Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:03:55.5216017Z 0x000000000000001a (FINI_ARRAY) 0xbaea5e0 2025-05-07T20:03:55.5216141Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.5216272Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:55.5216390Z 0x0000000000000005 (STRTAB) 0x5dd10 2025-05-07T20:03:55.5216506Z 0x0000000000000006 (SYMTAB) 0x14360 2025-05-07T20:03:55.5216666Z 0x000000000000000a (STRSZ) 3688571 (bytes) 2025-05-07T20:03:55.5216793Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.5216920Z 0x0000000000000003 (PLTGOT) 0xbaf21f0 2025-05-07T20:03:55.5217056Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:03:55.5217187Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.5217303Z 0x0000000000000017 (JMPREL) 0x443ae8 2025-05-07T20:03:55.5217427Z 0x0000000000000007 (RELA) 0x3e88a0 2025-05-07T20:03:55.5217581Z 0x0000000000000008 (RELASZ) 373320 (bytes) 2025-05-07T20:03:55.5217708Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.5217804Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.5217947Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.5218070Z 0x000000006ffffffe (VERNEED) 0x3e87b0 2025-05-07T20:03:55.5218180Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:55.5218305Z 0x000000006ffffff0 (VERSYM) 0x3e258c 2025-05-07T20:03:55.5218435Z 0x000000006ffffff9 (RELACOUNT) 1838 2025-05-07T20:03:55.5218543Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.5218548Z 2025-05-07T20:03:55.5218667Z ################################################################################ 2025-05-07T20:03:55.5218674Z 2025-05-07T20:03:55.5218678Z 2025-05-07T20:03:55.5218806Z ################################################################################ 2025-05-07T20:03:55.5219175Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.5219280Z [CHECK] Listing out library size: 2025-05-07T20:03:55.5219647Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.5219654Z 2025-05-07T20:03:55.5225907Z 5 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.5226195Z 2025-05-07T20:03:55.5226917Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.5227502Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.5227558Z 2025-05-07T20:03:55.5485390Z GLIBC_2.2.5 2025-05-07T20:03:55.5486635Z GLIBC_2.3 2025-05-07T20:03:55.5486919Z GLIBC_2.14 2025-05-07T20:03:55.5486938Z 2025-05-07T20:03:55.5486951Z 2025-05-07T20:03:55.5488468Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.5490431Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.5490450Z 2025-05-07T20:03:55.5746400Z GLIBCXX_3.4 2025-05-07T20:03:55.5746666Z GLIBCXX_3.4.9 2025-05-07T20:03:55.5747617Z GLIBCXX_3.4.11 2025-05-07T20:03:55.5747891Z GLIBCXX_3.4.15 2025-05-07T20:03:55.5748124Z GLIBCXX_3.4.18 2025-05-07T20:03:55.5748379Z GLIBCXX_3.4.20 2025-05-07T20:03:55.5748617Z GLIBCXX_3.4.21 2025-05-07T20:03:55.5748662Z 2025-05-07T20:03:55.5749719Z 2025-05-07T20:03:55.5769573Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.XNSN0q6kFl.symbols.txt 2025-05-07T20:03:55.5769602Z 2025-05-07T20:03:55.5983969Z 2025-05-07T20:03:55.6010677Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:03:55.6030738Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:03:55.6048840Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.p6cZchxwYf.usymbols.txt 2025-05-07T20:03:55.6048893Z 2025-05-07T20:03:55.6072784Z 2025-05-07T20:03:55.6096803Z [CHECK] Listing out undefined symbols (196 total): 2025-05-07T20:03:55.6113739Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.6114885Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.6115216Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.6115588Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.6115887Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.6116194Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.6116523Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.6116853Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.6117152Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.6117469Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.6117789Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.6118077Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.6118361Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.6118696Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.6118979Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.6119440Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:55.6119640Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:55.6119775Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:55.6119889Z U at::RecordFunction::end() 2025-05-07T20:03:55.6120017Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:55.6120176Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:55.6120955Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6121497Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:55.6122102Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6123621Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6124686Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:55.6125205Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:55.6125734Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.6126193Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:55.6126635Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.6127055Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:55.6127453Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:55.6127869Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:55.6128338Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:55.6128664Z U c10::AnyType::get() 2025-05-07T20:03:55.6128988Z U c10::BoolType::get() 2025-05-07T20:03:55.6129379Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:55.6129820Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:55.6130705Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:55.6131981Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:55.6133128Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.6133739Z U c10::Error::what() const 2025-05-07T20:03:55.6134040Z U c10::FloatType::get() 2025-05-07T20:03:55.6134361Z U c10::GradMode::is_enabled() 2025-05-07T20:03:55.6134681Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:55.6135067Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:55.6135459Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:55.6135802Z U c10::IValue::isBoolList() const 2025-05-07T20:03:55.6136154Z U c10::IValue::isIntList() const 2025-05-07T20:03:55.6136487Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:55.6136849Z U c10::IValue::isTensorList() const 2025-05-07T20:03:55.6137222Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.6137596Z U c10::IntType::get() 2025-05-07T20:03:55.6138293Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.6139073Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.6139485Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.6139846Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.6140225Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.6140682Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.6141374Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:55.6142011Z U c10::StringType::get() 2025-05-07T20:03:55.6142385Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:55.6142797Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.6143238Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:55.6143669Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.6144095Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:55.6144773Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.6145419Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.6145782Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:55.6146163Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:55.6146568Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:55.6146920Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.6147285Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:55.6147642Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:55.6148023Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:55.6148391Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.6148693Z U c10::SymIntType::get() 2025-05-07T20:03:55.6149024Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:55.6149339Z U c10::TensorType::get() 2025-05-07T20:03:55.6149658Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.6150287Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.6151290Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.6152138Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.6153264Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.6154289Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.6155372Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.6156539Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.6157172Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.6157619Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.6158003Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.6158655Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.6159330Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.6159916Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.6160364Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:55.6160893Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.6161346Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.6161795Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.6162310Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:55.6162793Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.6163185Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.6163586Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:55.6163885Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.6164162Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.6164462Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.6164790Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.6165146Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.6165479Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:55.6165892Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.6166575Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.6167435Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.6168368Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6169460Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.6170521Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6171461Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6172888Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6173824Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6174807Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6175757Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.6176542Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.6177374Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.6177945Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.6178277Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.6178776Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.6179143Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.6179543Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.6179937Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.6180297Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:55.6180790Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.6181674Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.6182473Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.6182831Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.6183167Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.6183499Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.6183813Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.6184229Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.6184730Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.6185197Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.6185580Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6186343Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.6187080Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6187768Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.6188139Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.6188462Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.6188745Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.6189069Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.6189908Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.6191108Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.6191957Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.6192531Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:55.6193071Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:55.6193765Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:55.6194286Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:55.6194802Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:55.6195453Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:55.6196316Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:55.6196859Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:55.6197343Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:55.6197771Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:55.6198111Z U torch::autograd::Node::metadata() 2025-05-07T20:03:55.6198478Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:55.6198971Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.6199615Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:55.6200151Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:55.6200719Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:55.6201275Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:55.6205053Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:55.6208244Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:55.6208732Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:55.6209226Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:55.6210344Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:55.6211432Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:55.6212117Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:55.6213028Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.6213621Z U typeinfo for c10::Error 2025-05-07T20:03:55.6213965Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.6214350Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:55.6214717Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.6215095Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.6215450Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:55.6215830Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.6216261Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.6216695Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:55.6217125Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.6217499Z U vtable for c10::Error 2025-05-07T20:03:55.6218046Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.6218634Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.6219107Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.6219572Z U vtable for torch::autograd::Node 2025-05-07T20:03:55.6219973Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.6220380Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.6220710Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.6221019Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.6221326Z w __gmon_start__ 2025-05-07T20:03:55.6221590Z w __pthread_key_create 2025-05-07T20:03:55.6222200Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.6222516Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.6222881Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.6223406Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.6224026Z 2025-05-07T20:03:55.6224175Z linux-vdso.so.1 (0x00007ffdc2f1d000) 2025-05-07T20:03:55.6224502Z libc10.so => not found 2025-05-07T20:03:55.6225111Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007ff1b2b88000) 2025-05-07T20:03:55.6226155Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ff1b1c00000) 2025-05-07T20:03:55.6226985Z libtorch.so => not found 2025-05-07T20:03:55.6227229Z libtorch_cpu.so => not found 2025-05-07T20:03:55.6227498Z libtorch_cuda.so => not found 2025-05-07T20:03:55.6227817Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ff1b199c000) 2025-05-07T20:03:55.6228237Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ff1b2b58000) 2025-05-07T20:03:55.6228611Z libc.so.6 => /lib64/libc.so.6 (0x00007ff1b1794000) 2025-05-07T20:03:55.6228969Z /lib64/ld-linux-x86-64.so.2 (0x00007ff1b2b97000) 2025-05-07T20:03:55.6229292Z libtorch.so => not found 2025-05-07T20:03:55.6229521Z libc10.so => not found 2025-05-07T20:03:55.6229759Z libtorch_cpu.so => not found 2025-05-07T20:03:55.6230013Z libtorch_cuda.so => not found 2025-05-07T20:03:55.6230328Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007ff1b2b00000) 2025-05-07T20:03:55.6230744Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007ff1b2afb000) 2025-05-07T20:03:55.6231120Z libtorch.so => not found 2025-05-07T20:03:55.6231351Z libc10.so => not found 2025-05-07T20:03:55.6231583Z libc10_cuda.so => not found 2025-05-07T20:03:55.6231836Z libtorch_cpu.so => not found 2025-05-07T20:03:55.6232096Z libtorch_cuda.so => not found 2025-05-07T20:03:55.6232359Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.6232755Z libm.so.6 => /lib64/libm.so.6 (0x00007ff1b2525000) 2025-05-07T20:03:55.6233169Z 2025-05-07T20:03:55.6233287Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.6233805Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:55.6234247Z 2025-05-07T20:03:55.6234252Z 2025-05-07T20:03:55.6234408Z Dynamic section at offset 0x4b06b0 contains 37 entries: 2025-05-07T20:03:55.6234790Z Tag Type Name/Value 2025-05-07T20:03:55.6235201Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.6235727Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:55.6236278Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:55.6236826Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.6237335Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.6237863Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.6238398Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.6238911Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.6239424Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.6239934Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.6240580Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:03:55.6241177Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.6241587Z 0x000000000000000c (INIT) 0xd0000 2025-05-07T20:03:55.6241928Z 0x000000000000000d (FINI) 0x3f2b18 2025-05-07T20:03:55.6242301Z 0x0000000000000019 (INIT_ARRAY) 0x4a9ff8 2025-05-07T20:03:55.6242661Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:03:55.6243007Z 0x000000000000001a (FINI_ARRAY) 0x4aa128 2025-05-07T20:03:55.6243349Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.6243717Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:55.6244045Z 0x0000000000000005 (STRTAB) 0x15da8 2025-05-07T20:03:55.6244402Z 0x0000000000000006 (SYMTAB) 0x4588 2025-05-07T20:03:55.6244764Z 0x000000000000000a (STRSZ) 609567 (bytes) 2025-05-07T20:03:55.6245250Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.6245569Z 0x0000000000000003 (PLTGOT) 0x4b1940 2025-05-07T20:03:55.6245910Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:03:55.6246261Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.6246575Z 0x0000000000000017 (JMPREL) 0xc7630 2025-05-07T20:03:55.6246888Z 0x0000000000000007 (RELA) 0xac330 2025-05-07T20:03:55.6247225Z 0x0000000000000008 (RELASZ) 111360 (bytes) 2025-05-07T20:03:55.6247569Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.6247878Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.6248199Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.6248532Z 0x000000006ffffffe (VERNEED) 0xac220 2025-05-07T20:03:55.6248856Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:55.6249165Z 0x000000006ffffff0 (VERSYM) 0xaaac8 2025-05-07T20:03:55.6249479Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:03:55.6249767Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.6249968Z 2025-05-07T20:03:55.6250079Z ################################################################################ 2025-05-07T20:03:55.6250293Z 2025-05-07T20:03:55.6250297Z 2025-05-07T20:03:55.6250418Z ################################################################################ 2025-05-07T20:03:55.6250905Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6251389Z [CHECK] Listing out library size: 2025-05-07T20:03:55.6251829Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6252209Z 2025-05-07T20:03:55.6252410Z 18 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6252716Z 2025-05-07T20:03:55.6253107Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6254062Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.6254648Z 2025-05-07T20:03:55.6323238Z GLIBC_2.2.5 2025-05-07T20:03:55.6323907Z GLIBC_2.3 2025-05-07T20:03:55.6324484Z GLIBC_2.14 2025-05-07T20:03:55.6324850Z 2025-05-07T20:03:55.6324863Z 2025-05-07T20:03:55.6326182Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6329370Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.6330608Z 2025-05-07T20:03:55.6439299Z GLIBCXX_3.4 2025-05-07T20:03:55.6440000Z GLIBCXX_3.4.9 2025-05-07T20:03:55.6440607Z GLIBCXX_3.4.11 2025-05-07T20:03:55.6441057Z GLIBCXX_3.4.15 2025-05-07T20:03:55.6441270Z GLIBCXX_3.4.18 2025-05-07T20:03:55.6441486Z GLIBCXX_3.4.20 2025-05-07T20:03:55.6441688Z GLIBCXX_3.4.21 2025-05-07T20:03:55.6441837Z 2025-05-07T20:03:55.6441842Z 2025-05-07T20:03:55.6460677Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.hjZg2jgj2l.symbols.txt 2025-05-07T20:03:55.6461220Z 2025-05-07T20:03:55.6541882Z 2025-05-07T20:03:55.6565633Z [CHECK] Total Number of symbols: 1515 2025-05-07T20:03:55.6583967Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:03:55.6598401Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.SD1FH1Rfbl.usymbols.txt 2025-05-07T20:03:55.6600227Z 2025-05-07T20:03:55.6621876Z 2025-05-07T20:03:55.6645331Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:03:55.6664047Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.6665246Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.6665806Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.6666257Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.6666677Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.6667075Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.6667482Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.6667870Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.6668377Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.6668852Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.6669215Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.6669535Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.6669829Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.6670141Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.6670440Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.6670766Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.6671070Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.6671379Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.6671678Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:55.6671989Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.6672294Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:55.6672731Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.6673231Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.6673554Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.6673981Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:55.6674362Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:55.6674786Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:55.6675211Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:55.6675573Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:55.6675961Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:55.6676406Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:55.6676833Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:55.6677169Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:55.6677565Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:03:55.6678008Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:55.6678885Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6680317Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6681269Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.6681798Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.6682247Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:55.6682727Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.6684300Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.6684906Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:55.6685330Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:55.6685931Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.6686795Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:03:55.6687298Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:55.6687841Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:55.6688517Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:55.6689595Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:55.6690542Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.6691010Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.6691780Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6692973Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.6693821Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:55.6694190Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:55.6694589Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:55.6694956Z U at::globalContext() 2025-05-07T20:03:55.6695305Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:03:55.6695683Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:55.6696014Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:55.6696329Z U bool at::Tensor::item() const 2025-05-07T20:03:55.6696647Z U c10::AnyType::get() 2025-05-07T20:03:55.6697025Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:55.6697512Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.6697931Z U c10::BoolType::get() 2025-05-07T20:03:55.6698283Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.6698842Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:55.6699227Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:55.6699922Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:55.6701249Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:55.6702314Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.6702944Z U c10::Error::what() const 2025-05-07T20:03:55.6703242Z U c10::GradMode::is_enabled() 2025-05-07T20:03:55.6703589Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:55.6704138Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.6704684Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:55.6705067Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:55.6705403Z U c10::IValue::isBoolList() const 2025-05-07T20:03:55.6705721Z U c10::IValue::isIntList() const 2025-05-07T20:03:55.6706089Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:55.6706599Z U c10::IValue::isTensorList() const 2025-05-07T20:03:55.6706965Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.6707332Z U c10::IntType::get() 2025-05-07T20:03:55.6708021Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.6708810Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.6709216Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.6709582Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.6709941Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.6710453Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:55.6711018Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.6711391Z U c10::StringType::get() 2025-05-07T20:03:55.6711755Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.6712517Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.6713184Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.6713596Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.6713931Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.6714255Z U c10::SymIntType::get() 2025-05-07T20:03:55.6714622Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:55.6715003Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:55.6715691Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:55.6716404Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.6716788Z U c10::TensorType::get() 2025-05-07T20:03:55.6717175Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:55.6717602Z U c10::Type::is_module() const 2025-05-07T20:03:55.6717950Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.6719033Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.6719961Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.6720309Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.6720628Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.6720954Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.6721271Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.6721629Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.6722067Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.6722513Z U c10::cuda::device_count() 2025-05-07T20:03:55.6722863Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.6723240Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.6723605Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.6723967Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.6724348Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.6724737Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.6725353Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.6726363Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.6727204Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.6728018Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.6728914Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.6729906Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.6730818Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:55.6731456Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:55.6732003Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:55.6732415Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.6732730Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.6733233Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.6733828Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.6734238Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:55.6734649Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:55.6735026Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.6735341Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.6735704Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.6736299Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.6736898Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.6737267Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.6737645Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.6738013Z U c10::throwNullDataPtrError() 2025-05-07T20:03:55.6738324Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:55.6738644Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.6738940Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:55.6739333Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.6739763Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.6740092Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.6740611Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.6741013Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.6741373Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.6741745Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.6742089Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.6742423Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.6742763Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.6743355Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.6743724Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.6744097Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.6744447Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.6744797Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.6745132Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.6745492Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.6745858Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.6746283Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:55.6746762Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.6747117Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.6747453Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.6747811Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:55.6748108Z U long at::Tensor::item() const 2025-05-07T20:03:55.6748515Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.6749044Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.6749434Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.6749886Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:55.6750160Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.6750420Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.6750691Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.6750980Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.6751299Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.6751614Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:55.6751990Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.6752715Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.6753795Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.6754720Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6755814Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.6756883Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6757812Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6758808Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.6759973Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6761242Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6762272Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.6763146Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.6763756Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.6764105Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.6764466Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.6764865Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.6765300Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.6765723Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.6766120Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:55.6766611Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.6767574Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.6768413Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.6768779Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.6769253Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.6769595Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.6769925Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.6770332Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.6770860Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.6771340Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.6771733Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6772145Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.6772567Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6773286Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.6773956Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.6774310Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.6774612Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.6774897Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.6775199Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.6776020Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.6777202Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.6778024Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.6778565Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:55.6779086Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:55.6779740Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:55.6780276Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:55.6780774Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:55.6781442Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:55.6782081Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:55.6782535Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:55.6783018Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:55.6783430Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:55.6783775Z U torch::autograd::Node::metadata() 2025-05-07T20:03:55.6784127Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:55.6784634Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.6785269Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:55.6786209Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:55.6786443Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:55.6786675Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:55.6789503Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:55.6789676Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:55.6789829Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:55.6790011Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:55.6790170Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:55.6790587Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:55.6790976Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.6791555Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:55.6791675Z U typeinfo for c10::Error 2025-05-07T20:03:55.6791787Z U typeinfo for c10::Type 2025-05-07T20:03:55.6791935Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.6792068Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.6792299Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.6792493Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:55.6792654Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.6792878Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.6793079Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:55.6793236Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.6793472Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.6793575Z U vtable for c10::Error 2025-05-07T20:03:55.6793715Z U vtable for c10::ListType 2025-05-07T20:03:55.6794065Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.6794204Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.6794432Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.6794572Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:55.6794693Z U vtable for torch::autograd::Node 2025-05-07T20:03:55.6794873Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.6794996Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.6795104Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.6795209Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.6795494Z w __gmon_start__ 2025-05-07T20:03:55.6795602Z w __pthread_key_create 2025-05-07T20:03:55.6795715Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.6795823Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.6795977Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.6796203Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6796211Z 2025-05-07T20:03:55.6796364Z linux-vdso.so.1 (0x00007ffc8a6ed000) 2025-05-07T20:03:55.6796457Z libc10.so => not found 2025-05-07T20:03:55.6796552Z libc10_cuda.so => not found 2025-05-07T20:03:55.6797126Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007ffad3708000) 2025-05-07T20:03:55.6797614Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007ffad2e00000) 2025-05-07T20:03:55.6797714Z libtorch.so => not found 2025-05-07T20:03:55.6797812Z libtorch_cpu.so => not found 2025-05-07T20:03:55.6797921Z libtorch_cuda.so => not found 2025-05-07T20:03:55.6798019Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.6798185Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007ffad2b9c000) 2025-05-07T20:03:55.6798322Z libm.so.6 => /lib64/libm.so.6 (0x00007ffad2ac1000) 2025-05-07T20:03:55.6798476Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007ffad4b93000) 2025-05-07T20:03:55.6798601Z libc.so.6 => /lib64/libc.so.6 (0x00007ffad28b9000) 2025-05-07T20:03:55.6798733Z /lib64/ld-linux-x86-64.so.2 (0x00007ffad4bc7000) 2025-05-07T20:03:55.6798830Z libc10.so => not found 2025-05-07T20:03:55.6798924Z libc10_cuda.so => not found 2025-05-07T20:03:55.6799013Z libtorch.so => not found 2025-05-07T20:03:55.6799120Z libtorch_cpu.so => not found 2025-05-07T20:03:55.6799214Z libtorch_cuda.so => not found 2025-05-07T20:03:55.6799312Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.6799465Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007ffad4b39000) 2025-05-07T20:03:55.6799567Z libtorch.so => not found 2025-05-07T20:03:55.6799655Z libc10.so => not found 2025-05-07T20:03:55.6799753Z libc10_cuda.so => not found 2025-05-07T20:03:55.6799888Z libtorch_cpu.so => not found 2025-05-07T20:03:55.6799987Z libtorch_cuda.so => not found 2025-05-07T20:03:55.6800084Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.6800089Z 2025-05-07T20:03:55.6800201Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.6800459Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:55.6800497Z 2025-05-07T20:03:55.6800500Z 2025-05-07T20:03:55.6800696Z Dynamic section at offset 0x11af470 contains 40 entries: 2025-05-07T20:03:55.6800825Z Tag Type Name/Value 2025-05-07T20:03:55.6801022Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.6801227Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.6801516Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:55.6801845Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:55.6802049Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.6802252Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.6802468Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.6802679Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.6802886Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.6803084Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:55.6803282Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.6803475Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.6803705Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.6803950Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:03:55.6804135Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.6804248Z 0x000000000000000c (INIT) 0x53000 2025-05-07T20:03:55.6804380Z 0x000000000000000d (FINI) 0x14c8cc 2025-05-07T20:03:55.6804502Z 0x0000000000000019 (INIT_ARRAY) 0x11ae010 2025-05-07T20:03:55.6804630Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:03:55.6804772Z 0x000000000000001a (FINI_ARRAY) 0x11ae0a0 2025-05-07T20:03:55.6805003Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.6805110Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.6805229Z 0x0000000000000005 (STRTAB) 0xb768 2025-05-07T20:03:55.6805329Z 0x0000000000000006 (SYMTAB) 0x2948 2025-05-07T20:03:55.6805453Z 0x000000000000000a (STRSZ) 240496 (bytes) 2025-05-07T20:03:55.6805565Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.6805688Z 0x0000000000000003 (PLTGOT) 0x11af730 2025-05-07T20:03:55.6805813Z 0x0000000000000002 (PLTRELSZ) 16896 (bytes) 2025-05-07T20:03:55.6805912Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.6806031Z 0x0000000000000017 (JMPREL) 0x4e360 2025-05-07T20:03:55.6806135Z 0x0000000000000007 (RELA) 0x47010 2025-05-07T20:03:55.6806259Z 0x0000000000000008 (RELASZ) 29520 (bytes) 2025-05-07T20:03:55.6806374Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.6806475Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.6806590Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.6806700Z 0x000000006ffffffe (VERNEED) 0x46eb0 2025-05-07T20:03:55.6806808Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:55.6806920Z 0x000000006ffffff0 (VERSYM) 0x462d8 2025-05-07T20:03:55.6807020Z 0x000000006ffffff9 (RELACOUNT) 213 2025-05-07T20:03:55.6807123Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.6807162Z 2025-05-07T20:03:55.6807270Z ################################################################################ 2025-05-07T20:03:55.6807274Z 2025-05-07T20:03:55.6807277Z 2025-05-07T20:03:55.6807383Z ################################################################################ 2025-05-07T20:03:55.6807716Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.6807841Z [CHECK] Listing out library size: 2025-05-07T20:03:55.6808127Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.6808131Z 2025-05-07T20:03:55.6808354Z 1 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.6808357Z 2025-05-07T20:03:55.6808786Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.6809287Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.6809292Z 2025-05-07T20:03:55.6814182Z GLIBC_2.2.5 2025-05-07T20:03:55.6814261Z GLIBC_2.3 2025-05-07T20:03:55.6814517Z GLIBC_2.14 2025-05-07T20:03:55.6814632Z 2025-05-07T20:03:55.6814747Z 2025-05-07T20:03:55.6815361Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.6815925Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.6815938Z 2025-05-07T20:03:55.6875737Z GLIBCXX_3.4 2025-05-07T20:03:55.6876012Z GLIBCXX_3.4.9 2025-05-07T20:03:55.6876274Z GLIBCXX_3.4.18 2025-05-07T20:03:55.6876507Z GLIBCXX_3.4.20 2025-05-07T20:03:55.6876753Z GLIBCXX_3.4.21 2025-05-07T20:03:55.6876769Z 2025-05-07T20:03:55.6876804Z 2025-05-07T20:03:55.6895763Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.TcPBXdYQzY.symbols.txt 2025-05-07T20:03:55.6895792Z 2025-05-07T20:03:55.6921863Z 2025-05-07T20:03:55.6947907Z [CHECK] Total Number of symbols: 349 2025-05-07T20:03:55.6967902Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:03:55.6987992Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.o0JNMw23Qv.usymbols.txt 2025-05-07T20:03:55.6988008Z 2025-05-07T20:03:55.7016254Z 2025-05-07T20:03:55.7045812Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:03:55.7063134Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.7064610Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.7064931Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.7065378Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.7065831Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.7066227Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.7066643Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.7067045Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.7067403Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.7067807Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.7068097Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.7068417Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.7068725Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.7069024Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.7069342Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.7072062Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.7072509Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.7072616Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:55.7072735Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.7073088Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.7073756Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.7074493Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.7074669Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.7074828Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.7074933Z U c10::IntType::get() 2025-05-07T20:03:55.7075109Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.7075253Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.7075486Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.7075910Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.7076069Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.7076191Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.7076298Z U c10::TensorType::get() 2025-05-07T20:03:55.7076426Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.7077185Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.7077329Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.7077473Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.7077599Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.7077719Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.7077847Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.7077974Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.7078239Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.7078352Z U c10::cuda::device_count() 2025-05-07T20:03:55.7078511Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.7078651Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.7078798Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.7078958Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.7079230Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.7079341Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.7079857Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.7080102Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.7080582Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.7080923Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.7081513Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.7081673Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.7081807Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.7081925Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.7082061Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.7082206Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.7082314Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.7082528Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.7082673Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.7082810Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.7082931Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.7083074Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.7083189Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.7083307Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.7083453Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.7083588Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.7083709Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.7083824Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.7083949Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.7084061Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.7084186Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.7084323Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.7084459Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.7084626Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.7084783Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.7084875Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.7084963Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.7085071Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.7085192Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.7085517Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.7086264Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.7086776Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.7087338Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.7110039Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.7110592Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.7111258Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.7111938Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.7112276Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.7112900Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.7113027Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.7113148Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.7113312Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.7113496Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.7113678Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.7113936Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.7114534Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.7114663Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.7114803Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.7114921Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.7115037Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.7115232Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.7115474Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.7115611Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.7115733Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.7115831Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.7115954Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.7116568Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.7117040Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.7117298Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.7117684Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.7117904Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.7118059Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.7118233Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.7118390Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.7118732Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.7119089Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.7119196Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.7119297Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.7119407Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.7119500Z w __gmon_start__ 2025-05-07T20:03:55.7119593Z w __pthread_key_create 2025-05-07T20:03:55.7119745Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.7120015Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.7120022Z 2025-05-07T20:03:55.7120153Z linux-vdso.so.1 (0x00007ffd4ff5e000) 2025-05-07T20:03:55.7120250Z libtorch.so => not found 2025-05-07T20:03:55.7120363Z libc10.so => not found 2025-05-07T20:03:55.7120456Z libc10_cuda.so => not found 2025-05-07T20:03:55.7120545Z libtorch_cpu.so => not found 2025-05-07T20:03:55.7120672Z libtorch_cuda.so => not found 2025-05-07T20:03:55.7120768Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.7120925Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0a570aa000) 2025-05-07T20:03:55.7121078Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f0a57054000) 2025-05-07T20:03:55.7121221Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0a57026000) 2025-05-07T20:03:55.7121374Z libc.so.6 => /lib64/libc.so.6 (0x00007f0a56e1e000) 2025-05-07T20:03:55.7121494Z /lib64/ld-linux-x86-64.so.2 (0x00007f0a57367000) 2025-05-07T20:03:55.7121623Z libm.so.6 => /lib64/libm.so.6 (0x00007f0a56d43000) 2025-05-07T20:03:55.7121628Z 2025-05-07T20:03:55.7121732Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.7122002Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:55.7122009Z 2025-05-07T20:03:55.7136001Z 2025-05-07T20:03:55.7137068Z Dynamic section at offset 0x50440 contains 37 entries: 2025-05-07T20:03:55.7137443Z Tag Type Name/Value 2025-05-07T20:03:55.7138044Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.7138629Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.7139215Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.7139815Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.7140426Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.7141048Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.7141631Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.7142216Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:55.7142701Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.7142897Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.7143225Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.7143484Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:03:55.7143597Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:03:55.7143702Z 0x000000000000000d (FINI) 0x2fa7c 2025-05-07T20:03:55.7143818Z 0x0000000000000019 (INIT_ARRAY) 0x50bf8 2025-05-07T20:03:55.7144049Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:03:55.7144154Z 0x000000000000001a (FINI_ARRAY) 0x50c20 2025-05-07T20:03:55.7144271Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.7144375Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.7144482Z 0x0000000000000005 (STRTAB) 0x2e30 2025-05-07T20:03:55.7144610Z 0x0000000000000006 (SYMTAB) 0xd60 2025-05-07T20:03:55.7144733Z 0x000000000000000a (STRSZ) 35916 (bytes) 2025-05-07T20:03:55.7144845Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.7144960Z 0x0000000000000003 (PLTGOT) 0x516e0 2025-05-07T20:03:55.7145083Z 0x0000000000000002 (PLTRELSZ) 5544 (bytes) 2025-05-07T20:03:55.7145183Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.7145286Z 0x0000000000000017 (JMPREL) 0xdc00 2025-05-07T20:03:55.7145392Z 0x0000000000000007 (RELA) 0xbe48 2025-05-07T20:03:55.7145513Z 0x0000000000000008 (RELASZ) 7608 (bytes) 2025-05-07T20:03:55.7145758Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.7145862Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.7145983Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.7146135Z 0x000000006ffffffe (VERNEED) 0xbd38 2025-05-07T20:03:55.7146235Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:55.7146390Z 0x000000006ffffff0 (VERSYM) 0xba7c 2025-05-07T20:03:55.7146493Z 0x000000006ffffff9 (RELACOUNT) 152 2025-05-07T20:03:55.7146582Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.7146589Z 2025-05-07T20:03:55.7146704Z ################################################################################ 2025-05-07T20:03:55.7146708Z 2025-05-07T20:03:55.7146712Z 2025-05-07T20:03:55.7146849Z ################################################################################ 2025-05-07T20:03:55.7147069Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.7147181Z [CHECK] Listing out library size: 2025-05-07T20:03:55.7147572Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.7147578Z 2025-05-07T20:03:55.7148003Z 40 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.7148797Z 2025-05-07T20:03:55.7149759Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.7150234Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.7150239Z 2025-05-07T20:03:55.7546208Z GLIBC_2.2.5 2025-05-07T20:03:55.7546456Z GLIBC_2.3 2025-05-07T20:03:55.7546685Z GLIBC_2.14 2025-05-07T20:03:55.7546701Z 2025-05-07T20:03:55.7546741Z 2025-05-07T20:03:55.7547899Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.7549365Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.7549382Z 2025-05-07T20:03:55.7937786Z GLIBCXX_3.4 2025-05-07T20:03:55.7938316Z GLIBCXX_3.4.9 2025-05-07T20:03:55.7938594Z GLIBCXX_3.4.11 2025-05-07T20:03:55.7938856Z GLIBCXX_3.4.14 2025-05-07T20:03:55.7939102Z GLIBCXX_3.4.15 2025-05-07T20:03:55.7939342Z GLIBCXX_3.4.18 2025-05-07T20:03:55.7939568Z GLIBCXX_3.4.19 2025-05-07T20:03:55.7939815Z GLIBCXX_3.4.20 2025-05-07T20:03:55.7940043Z GLIBCXX_3.4.21 2025-05-07T20:03:55.7940192Z 2025-05-07T20:03:55.7940196Z 2025-05-07T20:03:55.7956934Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.7QZgCKRspc.symbols.txt 2025-05-07T20:03:55.7956976Z 2025-05-07T20:03:55.8279208Z 2025-05-07T20:03:55.8308162Z [CHECK] Total Number of symbols: 6602 2025-05-07T20:03:55.8330524Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:03:55.8347407Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.XAKRv9OS1U.usymbols.txt 2025-05-07T20:03:55.8347451Z 2025-05-07T20:03:55.8380898Z 2025-05-07T20:03:55.8403691Z [CHECK] Listing out undefined symbols (472 total): 2025-05-07T20:03:55.8421585Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.8422647Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.8422892Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.8423022Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:03:55.8423200Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.8423368Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.8423507Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.8423663Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.8424036Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.8424159Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.8424303Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.8424541Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.8424694Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.8424818Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.8424939Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.8425051Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.8425224Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.8425365Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.8425527Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.8425652Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:55.8425757Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.8425898Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:55.8425997Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.8426106Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.8426234Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:55.8426336Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.8426523Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:55.8426669Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:55.8426794Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:55.8426944Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:55.8427066Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:03:55.8427199Z U at::SplitUntil32Bit::end() const 2025-05-07T20:03:55.8427352Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:03:55.8427494Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:03:55.8427732Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:55.8427926Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:55.8428107Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:03:55.8428284Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:03:55.8428423Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:03:55.8428566Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:03:55.8428700Z U at::TensorIteratorBase::numel() const 2025-05-07T20:03:55.8428858Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:03:55.8429077Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:03:55.8429312Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:03:55.8429428Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:55.8429569Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:03:55.8429730Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.8429973Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.8430199Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.8430334Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:55.8430688Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:03:55.8430911Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.8431124Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:03:55.8431337Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:03:55.8431550Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.8431810Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:55.8431996Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:55.8432160Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:55.8432593Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:03:55.8433071Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:03:55.8433258Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.8433881Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8434550Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8434732Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:55.8434921Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:55.8435047Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:03:55.8435610Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8435790Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.8436106Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:03:55.8436331Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:55.8436457Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:03:55.8436627Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.8436760Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:55.8436941Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.8437530Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8437725Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.8438267Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8438465Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.8438784Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:55.8439086Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:55.8439532Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.8439931Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:55.8440083Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:03:55.8440367Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:55.8440548Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:03:55.8440786Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:55.8440978Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:03:55.8441311Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:55.8441728Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:55.8442347Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:55.8442515Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:03:55.8442777Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:03:55.8442914Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:55.8443078Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.8443242Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.8443355Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:55.8443811Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8444379Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8444645Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:03:55.8444771Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:03:55.8444913Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:55.8445044Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:55.8445185Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:03:55.8445526Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:03:55.8445650Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:55.8445796Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:55.8445909Z U at::get_num_threads() 2025-05-07T20:03:55.8446009Z U at::get_thread_num() 2025-05-07T20:03:55.8446216Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:03:55.8446350Z U at::internal::set_thread_num(int) 2025-05-07T20:03:55.8446580Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:03:55.8447134Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8447752Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.8448034Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:55.8448197Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:03:55.8448335Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:55.8448515Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:55.8448607Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:55.8448730Z U bool at::Tensor::item() const 2025-05-07T20:03:55.8448857Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8449023Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8449129Z U c10::AnyType::get() 2025-05-07T20:03:55.8449284Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:55.8449457Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8449654Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8449748Z U c10::BoolType::get() 2025-05-07T20:03:55.8449848Z U c10::DeviceObjType::get() 2025-05-07T20:03:55.8449995Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.8450165Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:55.8450269Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:55.8450761Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:55.8451366Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:55.8451717Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.8451818Z U c10::Error::what() const 2025-05-07T20:03:55.8451906Z U c10::FloatType::get() 2025-05-07T20:03:55.8452003Z U c10::GradMode::is_enabled() 2025-05-07T20:03:55.8452104Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:55.8452250Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8452411Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8452563Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:55.8452688Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:55.8452790Z U c10::IValue::isBoolList() const 2025-05-07T20:03:55.8452890Z U c10::IValue::isIntList() const 2025-05-07T20:03:55.8453000Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:55.8453103Z U c10::IValue::isTensorList() const 2025-05-07T20:03:55.8453237Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.8453342Z U c10::InferenceMode::is_enabled() 2025-05-07T20:03:55.8453432Z U c10::IntType::get() 2025-05-07T20:03:55.8453885Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.8454050Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.8454167Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.8454289Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.8455915Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.8456124Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.8456247Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:55.8456568Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:55.8456673Z U c10::ScalarTypeType::get() 2025-05-07T20:03:55.8456974Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:55.8457292Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:03:55.8457444Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.8457574Z U c10::StringType::get() 2025-05-07T20:03:55.8457719Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:55.8457855Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.8457994Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.8458395Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.8458527Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.8458662Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:03:55.8458805Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:55.8458931Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:55.8459041Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.8459167Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:55.8459303Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:55.8459411Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.8459511Z U c10::SymIntType::get() 2025-05-07T20:03:55.8459659Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:55.8459775Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:55.8460197Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:55.8460363Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.8460459Z U c10::TensorType::get() 2025-05-07T20:03:55.8461219Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:55.8461418Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:55.8461523Z U c10::Type::is_module() const 2025-05-07T20:03:55.8461637Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.8462334Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.8462462Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.8462632Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:55.8462885Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:55.8463211Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:55.8463335Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.8463478Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.8463585Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.8463698Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.8463813Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.8464079Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.8464213Z U c10::cuda::current_device() 2025-05-07T20:03:55.8464317Z U c10::cuda::device_count() 2025-05-07T20:03:55.8464451Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.8464582Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.8464753Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.8464887Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.8465040Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.8465164Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.8465576Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.8466073Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.8466503Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.8466999Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.8467350Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.8468122Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.8468404Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:55.8468702Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:55.8468911Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:55.8469030Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.8469153Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.8469489Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.8469678Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.8469822Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:03:55.8469954Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:03:55.8470104Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:55.8470287Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:55.8470416Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.8470540Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.8470695Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.8471092Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.8471220Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:03:55.8471347Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:55.8471498Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:03:55.8471657Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.8471801Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:03:55.8471940Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:03:55.8472088Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:55.8472281Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.8472564Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.8472732Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.8472880Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.8473051Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:03:55.8473174Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:55.8473303Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:55.8473436Z U c10::report_overflow(char const*) 2025-05-07T20:03:55.8473621Z U c10::throwNullDataPtrError() 2025-05-07T20:03:55.8473744Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:55.8473867Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.8473985Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:55.8474188Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.8474307Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.8474433Z U cublasGemmStridedBatchedEx 2025-05-07T20:03:55.8474536Z U cublasSetStream_v2 2025-05-07T20:03:55.8474670Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.8474822Z U cudaDeviceGetByPCIBusId@libcudart.so.11.0 2025-05-07T20:03:55.8474953Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.8475093Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.8475231Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.8475362Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.8475484Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.8475609Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.8475752Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.8475863Z U cudaFree@libcudart.so.11.0 2025-05-07T20:03:55.8475994Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:55.8476135Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.8476254Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:55.8476381Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:55.8476533Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.8476660Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.8476780Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.8476922Z U cudaHostGetDevicePointer@libcudart.so.11.0 2025-05-07T20:03:55.8477057Z U cudaHostRegister@libcudart.so.11.0 2025-05-07T20:03:55.8477182Z U cudaHostUnregister@libcudart.so.11.0 2025-05-07T20:03:55.8477307Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.8477447Z U cudaMallocManaged@libcudart.so.11.0 2025-05-07T20:03:55.8477564Z U cudaMemAdvise@libcudart.so.11.0 2025-05-07T20:03:55.8477697Z U cudaMemPrefetchAsync@libcudart.so.11.0 2025-05-07T20:03:55.8477828Z U cudaMemcpy2DAsync@libcudart.so.11.0 2025-05-07T20:03:55.8477948Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:55.8478250Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.8478411Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:55.8478541Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:55.8478665Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.8478828Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.8479089Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.8479257Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8479418Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8479528Z U exit@GLIBC_2.2.5 2025-05-07T20:03:55.8479624Z U exp10@GLIBC_2.2.5 2025-05-07T20:03:55.8479716Z U exp2@GLIBC_2.2.5 2025-05-07T20:03:55.8479832Z U exp@GLIBC_2.2.5 2025-05-07T20:03:55.8479939Z U expf@GLIBC_2.2.5 2025-05-07T20:03:55.8480130Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:55.8480320Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.8480527Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:55.8480719Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.8480914Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.8481064Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8481216Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8481309Z U fmod@GLIBC_2.2.5 2025-05-07T20:03:55.8481411Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.8481525Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:03:55.8481634Z U int at::Tensor::item() const 2025-05-07T20:03:55.8481805Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:55.8481931Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8482073Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8482167Z U isnan@GLIBC_2.2.5 2025-05-07T20:03:55.8482280Z U lgamma@GLIBC_2.2.5 2025-05-07T20:03:55.8482377Z U llrint@GLIBC_2.2.5 2025-05-07T20:03:55.8482469Z U llround@GLIBC_2.2.5 2025-05-07T20:03:55.8482573Z U log10@GLIBC_2.2.5 2025-05-07T20:03:55.8482665Z U log2@GLIBC_2.2.5 2025-05-07T20:03:55.8482754Z U log@GLIBC_2.2.5 2025-05-07T20:03:55.8482843Z U logl@GLIBC_2.2.5 2025-05-07T20:03:55.8482968Z U long at::Tensor::item() const 2025-05-07T20:03:55.8483135Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.8483299Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:55.8483436Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8483576Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8483669Z U lrint@GLIBC_2.2.5 2025-05-07T20:03:55.8483775Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:55.8483866Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:55.8483954Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:55.8484051Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.8484138Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.8484226Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.8484324Z U nextafter@GLIBC_2.2.5 2025-05-07T20:03:55.8484431Z U nvmlDeviceGetCount_v2 2025-05-07T20:03:55.8484540Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:03:55.8484662Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:03:55.8484802Z U nvmlDeviceGetNvLinkState 2025-05-07T20:03:55.8484901Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:03:55.8484988Z U nvmlInit_v2 2025-05-07T20:03:55.8485098Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.8485255Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.8485399Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.8485493Z U pow@GLIBC_2.2.5 2025-05-07T20:03:55.8485596Z U printf@GLIBC_2.2.5 2025-05-07T20:03:55.8486316Z U puts@GLIBC_2.2.5 2025-05-07T20:03:55.8486521Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:55.8486803Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8487011Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8487107Z U sin@GLIBC_2.2.5 2025-05-07T20:03:55.8487343Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.8487520Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:55.8487715Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:03:55.8487892Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:55.8488302Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:55.8488655Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.8489077Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.8489495Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8490051Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.8490465Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8490885Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8491367Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.8491904Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8492476Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8492932Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.8493330Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.8493713Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.8493836Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:55.8493959Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:55.8494156Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.8494281Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.8494396Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:55.8494539Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:55.8494747Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.8494935Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.8495080Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.8495266Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.8495404Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.8495574Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:55.8495799Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:03:55.8496158Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:55.8496408Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:03:55.8496668Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.8497020Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:55.8497269Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:03:55.8497884Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.8498041Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:03:55.8498150Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:55.8498325Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:55.8498458Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.8498584Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.8498838Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.8498949Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.8499052Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.8499244Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:55.8499412Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.8499630Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.8499748Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:55.8499871Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.8499985Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:55.8500127Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:03:55.8500284Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8500413Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.8500589Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8500996Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.8501137Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.8501255Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.8501376Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.8501470Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.8501564Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:55.8501691Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.8502338Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.8502788Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.8503295Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:55.8503539Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.8503676Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:55.8503955Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:55.8504130Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:55.8504333Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:55.8504511Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:55.8504847Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:55.8505002Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:55.8505177Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:55.8505342Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:55.8505466Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:55.8505572Z U torch::autograd::Node::metadata() 2025-05-07T20:03:55.8505698Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:55.8505941Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.8506195Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:55.8506327Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:55.8506543Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:55.8506749Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:55.8509313Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:55.8509465Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:55.8509612Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:55.8509770Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:55.8509945Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:55.8510332Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:55.8510715Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.8511133Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.8511337Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:03:55.8511454Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:03:55.8512211Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:55.8512331Z U typeinfo for c10::Error 2025-05-07T20:03:55.8512527Z U typeinfo for c10::Type 2025-05-07T20:03:55.8512671Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.8512805Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:55.8512931Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.8513247Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.8513367Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:55.8513587Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.8513804Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.8514268Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.8514804Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.8515263Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.8515814Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.8516262Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:03:55.8516793Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:03:55.8517287Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:55.8517847Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:55.8518390Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:55.8518997Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:55.8519595Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:55.8519762Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.8519923Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.8520118Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:55.8520293Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.8520455Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.8520603Z U vtable for at::TensorIterator 2025-05-07T20:03:55.8520733Z U vtable for at::TensorIteratorBase 2025-05-07T20:03:55.8520863Z U vtable for c10::Error 2025-05-07T20:03:55.8520969Z U vtable for c10::ListType 2025-05-07T20:03:55.8521319Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.8521458Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.8521715Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.8521870Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:55.8521994Z U vtable for torch::autograd::Node 2025-05-07T20:03:55.8522179Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.8522303Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.8522416Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.8522524Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.8522624Z w __gmon_start__ 2025-05-07T20:03:55.8522740Z w __pthread_key_create 2025-05-07T20:03:55.8522859Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.8522979Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.8523083Z w pthread_once 2025-05-07T20:03:55.8523239Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.8523421Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.8523429Z 2025-05-07T20:03:55.8523605Z linux-vdso.so.1 (0x00007ffe75bfe000) 2025-05-07T20:03:55.8523705Z libc10.so => not found 2025-05-07T20:03:55.8523802Z libc10_cuda.so => not found 2025-05-07T20:03:55.8524196Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007fe378e00000) 2025-05-07T20:03:55.8524305Z libnvidia-ml.so.1 => not found 2025-05-07T20:03:55.8524405Z libtorch.so => not found 2025-05-07T20:03:55.8524992Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fe378d08000) 2025-05-07T20:03:55.8525468Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fe378400000) 2025-05-07T20:03:55.8525576Z libtorch_cpu.so => not found 2025-05-07T20:03:55.8525691Z libtorch_cuda.so => not found 2025-05-07T20:03:55.8525789Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.8525956Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe37819c000) 2025-05-07T20:03:55.8526087Z libm.so.6 => /lib64/libm.so.6 (0x00007fe3780c1000) 2025-05-07T20:03:55.8526260Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe37bffa000) 2025-05-07T20:03:55.8526506Z libc.so.6 => /lib64/libc.so.6 (0x00007fe377eb9000) 2025-05-07T20:03:55.8526635Z /lib64/ld-linux-x86-64.so.2 (0x00007fe37c030000) 2025-05-07T20:03:55.8526747Z libc10.so => not found 2025-05-07T20:03:55.8527106Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007fe379388000) 2025-05-07T20:03:55.8527384Z libtorch.so => not found 2025-05-07T20:03:55.8527524Z libtorch_cpu.so => not found 2025-05-07T20:03:55.8527624Z libtorch_cuda.so => not found 2025-05-07T20:03:55.8527716Z libc10.so => not found 2025-05-07T20:03:55.8527813Z libc10_cuda.so => not found 2025-05-07T20:03:55.8527922Z libtorch.so => not found 2025-05-07T20:03:55.8528018Z libtorch_cpu.so => not found 2025-05-07T20:03:55.8528114Z libtorch_cuda.so => not found 2025-05-07T20:03:55.8528254Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.8528408Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fe378cb2000) 2025-05-07T20:03:55.8528507Z libtorch.so => not found 2025-05-07T20:03:55.8528602Z libc10.so => not found 2025-05-07T20:03:55.8528744Z libc10_cuda.so => not found 2025-05-07T20:03:55.8528846Z libtorch_cpu.so => not found 2025-05-07T20:03:55.8528945Z libtorch_cuda.so => not found 2025-05-07T20:03:55.8529099Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.8529197Z libtorch_cpu.so => not found 2025-05-07T20:03:55.8529293Z libtorch_cuda.so => not found 2025-05-07T20:03:55.8529393Z libtorch.so => not found 2025-05-07T20:03:55.8529543Z librt.so.1 => /lib64/librt.so.1 (0x00007fe37bfed000) 2025-05-07T20:03:55.8529728Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fe37bfe8000) 2025-05-07T20:03:55.8529762Z 2025-05-07T20:03:55.8529879Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.8530108Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.8530115Z 2025-05-07T20:03:55.8530119Z 2025-05-07T20:03:55.8530286Z Dynamic section at offset 0x27457c0 contains 42 entries: 2025-05-07T20:03:55.8530409Z Tag Type Name/Value 2025-05-07T20:03:55.8530625Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.8530830Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.8531028Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:55.8531267Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:03:55.8531463Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.8531724Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:55.8531949Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:55.8532174Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.8532378Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.8532586Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.8532816Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.8533009Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:55.8533207Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.8533427Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.8533650Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.8533866Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:03:55.8534072Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.8534189Z 0x000000000000000c (INIT) 0x1b0000 2025-05-07T20:03:55.8534307Z 0x000000000000000d (FINI) 0x73d51c 2025-05-07T20:03:55.8534434Z 0x0000000000000019 (INIT_ARRAY) 0x27387d0 2025-05-07T20:03:55.8534591Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:03:55.8534718Z 0x000000000000001a (FINI_ARRAY) 0x2738c58 2025-05-07T20:03:55.8534844Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.8534987Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.8535100Z 0x0000000000000005 (STRTAB) 0x2fcd8 2025-05-07T20:03:55.8535210Z 0x0000000000000006 (SYMTAB) 0x91d0 2025-05-07T20:03:55.8535350Z 0x000000000000000a (STRSZ) 1264098 (bytes) 2025-05-07T20:03:55.8535487Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.8535607Z 0x0000000000000003 (PLTGOT) 0x2746aa0 2025-05-07T20:03:55.8535740Z 0x0000000000000002 (PLTRELSZ) 68832 (bytes) 2025-05-07T20:03:55.8535915Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.8536026Z 0x0000000000000017 (JMPREL) 0x19e3c8 2025-05-07T20:03:55.8536140Z 0x0000000000000007 (RELA) 0x167bd0 2025-05-07T20:03:55.8536291Z 0x0000000000000008 (RELASZ) 223224 (bytes) 2025-05-07T20:03:55.8536438Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.8536534Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.8536686Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.8536821Z 0x000000006ffffffe (VERNEED) 0x167a50 2025-05-07T20:03:55.8536929Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:55.8537049Z 0x000000006ffffff0 (VERSYM) 0x1646ba 2025-05-07T20:03:55.8537174Z 0x000000006ffffff9 (RELACOUNT) 2456 2025-05-07T20:03:55.8537300Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.8537305Z 2025-05-07T20:03:55.8537418Z ################################################################################ 2025-05-07T20:03:55.8537425Z 2025-05-07T20:03:55.8537429Z 2025-05-07T20:03:55.8537547Z ################################################################################ 2025-05-07T20:03:55.8537865Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:55.8537975Z [CHECK] Listing out library size: 2025-05-07T20:03:55.8538294Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:55.8538299Z 2025-05-07T20:03:55.8538555Z 492 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:55.8538559Z 2025-05-07T20:03:55.8538993Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:55.8539542Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.8539549Z 2025-05-07T20:03:56.0474518Z GLIBC_2.2.5 2025-05-07T20:03:56.0474790Z GLIBC_2.3 2025-05-07T20:03:56.0475071Z GLIBC_2.14 2025-05-07T20:03:56.0475088Z 2025-05-07T20:03:56.0475101Z 2025-05-07T20:03:56.0476486Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:56.0478320Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.0478338Z 2025-05-07T20:03:56.2419611Z GLIBCXX_3.4 2025-05-07T20:03:56.2420018Z GLIBCXX_3.4.9 2025-05-07T20:03:56.2420576Z GLIBCXX_3.4.11 2025-05-07T20:03:56.2420852Z GLIBCXX_3.4.14 2025-05-07T20:03:56.2421076Z GLIBCXX_3.4.15 2025-05-07T20:03:56.2421288Z GLIBCXX_3.4.18 2025-05-07T20:03:56.2421510Z GLIBCXX_3.4.20 2025-05-07T20:03:56.2421715Z GLIBCXX_3.4.21 2025-05-07T20:03:56.2421852Z 2025-05-07T20:03:56.2421861Z 2025-05-07T20:03:56.2438385Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.zORFDm4YwY.symbols.txt 2025-05-07T20:03:56.2439978Z 2025-05-07T20:03:56.4383587Z 2025-05-07T20:03:56.4464225Z [CHECK] Total Number of symbols: 12554 2025-05-07T20:03:56.4555498Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:03:56.4573658Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.rdEJ53qKv7.usymbols.txt 2025-05-07T20:03:56.4574243Z 2025-05-07T20:03:56.4634020Z 2025-05-07T20:03:56.4658768Z [CHECK] Listing out undefined symbols (280 total): 2025-05-07T20:03:56.4674646Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.4675570Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.4676150Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:56.4676733Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.4677176Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.4677566Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.4678028Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:56.4678420Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:56.4678818Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:56.4679194Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.4679557Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:56.4679893Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:56.4680287Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:56.4680622Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:56.4680948Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:56.4681273Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:56.4681606Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:56.4681924Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:56.4682255Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:56.4682581Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:56.4682905Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:56.4683214Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:56.4683530Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:56.4683975Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:56.4684342Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:56.4684742Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:56.4685124Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:56.4685514Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:56.4686224Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:56.4686667Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:56.4687116Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:56.4687751Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:56.4688360Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:56.4689243Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.4690610Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.4691669Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:56.4692865Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.4694030Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:56.4694583Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:56.4695007Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:56.4695754Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.4696988Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.4697882Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:56.4698322Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:56.4698691Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:56.4699084Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:56.4699435Z U at::get_thread_num() 2025-05-07T20:03:56.4699737Z U at::globalContext() 2025-05-07T20:03:56.4700075Z U at::internal::set_thread_num(int) 2025-05-07T20:03:56.4700541Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:56.4700921Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:56.4701322Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:56.4701643Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:56.4701912Z U c10::AnyType::get() 2025-05-07T20:03:56.4702293Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.4702684Z U c10::BoolType::get() 2025-05-07T20:03:56.4703033Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:56.4703454Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:56.4703837Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:56.4704547Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:56.4705919Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:56.4707037Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:56.4707628Z U c10::Error::what() const 2025-05-07T20:03:56.4707921Z U c10::FloatType::get() 2025-05-07T20:03:56.4708246Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:56.4708567Z U c10::GradMode::is_enabled() 2025-05-07T20:03:56.4708887Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:56.4709247Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.4709687Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.4710132Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:56.4710521Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:56.4710852Z U c10::IValue::isBoolList() const 2025-05-07T20:03:56.4711168Z U c10::IValue::isIntList() const 2025-05-07T20:03:56.4711495Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:56.4711827Z U c10::IValue::isTensorList() const 2025-05-07T20:03:56.4712182Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:56.4712627Z U c10::IntType::get() 2025-05-07T20:03:56.4713169Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:56.4713592Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:56.4713956Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:56.4714322Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:56.4714775Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:56.4715339Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:56.4715712Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:56.4716219Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:56.4716816Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.4717225Z U c10::StringType::get() 2025-05-07T20:03:56.4717575Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:56.4717987Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:56.4718676Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:56.4719446Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:56.4719796Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:56.4720104Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:56.4720407Z U c10::SymIntType::get() 2025-05-07T20:03:56.4720732Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:56.4721104Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:56.4721465Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.4721811Z U c10::TensorType::get() 2025-05-07T20:03:56.4722123Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:56.4723025Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:56.4723947Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:56.4724294Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:56.4724618Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:56.4725132Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:56.4725465Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:56.4725814Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:56.4726277Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:56.4726751Z U c10::cuda::device_count() 2025-05-07T20:03:56.4727102Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:56.4727784Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:56.4728179Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:56.4728583Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:56.4728990Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:56.4729395Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:56.4730062Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:56.4731152Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:56.4732062Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:56.4732939Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.4733907Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:56.4735117Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.4736037Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:56.4736371Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:56.4736901Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:56.4737480Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:56.4737905Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:56.4738329Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:56.4738742Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:56.4739085Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:56.4739440Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:56.4740053Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:56.4740647Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:56.4741190Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:56.4741606Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:56.4742020Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:56.4742451Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:56.4742847Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:03:56.4743202Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:56.4743563Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:56.4743909Z U c10::throwNullDataPtrError() 2025-05-07T20:03:56.4744258Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:56.4744579Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:56.4744996Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:56.4745430Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:56.4745787Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.4746164Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.4746531Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:56.4746912Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:56.4747263Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:56.4747623Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:56.4747977Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:56.4748334Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.4748710Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.4749080Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:56.4749464Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:56.4749813Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:56.4750174Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:56.4750524Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:56.4750893Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.4751265Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:56.4752270Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:56.4753843Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:03:56.4754448Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:56.4754881Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:56.4755375Z U float at::Tensor::item() const 2025-05-07T20:03:56.4755774Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.4756204Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.4756577Z U free@GLIBC_2.2.5 2025-05-07T20:03:56.4756913Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.4757314Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.4757775Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:56.4758218Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.4758615Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.4758990Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:56.4759273Z U memcpy@GLIBC_2.14 2025-05-07T20:03:56.4759574Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:56.4759879Z U memset@GLIBC_2.2.5 2025-05-07T20:03:56.4760188Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:56.4760555Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:56.4761124Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.4761910Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.4762672Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.4763457Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.4764257Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.4765034Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.4765694Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:56.4766315Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:03:56.4767249Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:03:56.4767850Z U sqrt@GLIBC_2.2.5 2025-05-07T20:03:56.4768119Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:56.4768510Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:56.4769153Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:56.4769955Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:56.4770836Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4771835Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.4772813Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4773935Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4774920Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:56.4776048Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4777271Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4778284Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:56.4779092Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:56.4779698Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:56.4780039Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:56.4780390Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:56.4780786Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.4781177Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.4781604Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:56.4782024Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:56.4782409Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:56.4782902Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:56.4783833Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.4784655Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:56.4785017Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:56.4785364Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:56.4785896Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:56.4786410Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:56.4786903Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.4787459Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.4787945Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:56.4788359Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4788770Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:56.4789458Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:56.4790159Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:56.4790518Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:56.4790838Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:56.4791119Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:56.4791444Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:56.4792288Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:56.4793648Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.4794499Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.4795057Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:56.4795634Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:56.4796243Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:56.4796740Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:56.4797297Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:56.4797957Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:56.4798595Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:56.4799060Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:56.4799538Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:56.4799967Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:56.4800311Z U torch::autograd::Node::metadata() 2025-05-07T20:03:56.4800688Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:56.4801203Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:56.4801837Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:56.4802384Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:56.4802849Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:56.4803415Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:56.4806712Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:56.4809747Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:56.4810174Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:56.4810599Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:56.4811044Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:56.4811741Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:56.4812639Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:56.4813707Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:56.4814530Z U typeinfo for c10::Error 2025-05-07T20:03:56.4814874Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:56.4815267Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:56.4815625Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:56.4816034Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:56.4816431Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:56.4817744Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:56.4820078Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:56.4821454Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:56.4821885Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:56.4822444Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:56.4822865Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:56.4823224Z U vtable for c10::Error 2025-05-07T20:03:56.4823753Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.4824323Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:56.4824786Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:56.4825238Z U vtable for torch::autograd::Node 2025-05-07T20:03:56.4825626Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:56.4826029Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:56.4826348Z w _ITM_registerTMCloneTable 2025-05-07T20:03:56.4826659Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:56.4826950Z w __gmon_start__ 2025-05-07T20:03:56.4827225Z w __pthread_key_create 2025-05-07T20:03:56.4827517Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:56.4827844Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:56.4828205Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:56.4828691Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:56.4829042Z 2025-05-07T20:03:56.4829156Z linux-vdso.so.1 (0x00007ffd38f9d000) 2025-05-07T20:03:56.4829434Z libc10.so => not found 2025-05-07T20:03:56.4829676Z libc10_cuda.so => not found 2025-05-07T20:03:56.4830311Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f009b60a000) 2025-05-07T20:03:56.4831450Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f009b512000) 2025-05-07T20:03:56.4832203Z libtorch.so => not found 2025-05-07T20:03:56.4832966Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f009ae00000) 2025-05-07T20:03:56.4833929Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f009a400000) 2025-05-07T20:03:56.4834605Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4834884Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4835165Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.4835533Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f009a19c000) 2025-05-07T20:03:56.4835939Z libm.so.6 => /lib64/libm.so.6 (0x00007f009b437000) 2025-05-07T20:03:56.4836321Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f00bb98d000) 2025-05-07T20:03:56.4836710Z libc.so.6 => /lib64/libc.so.6 (0x00007f0099f94000) 2025-05-07T20:03:56.4837100Z /lib64/ld-linux-x86-64.so.2 (0x00007f00bb9c3000) 2025-05-07T20:03:56.4837433Z libc10.so => not found 2025-05-07T20:03:56.4837704Z libc10_cuda.so => not found 2025-05-07T20:03:56.4838341Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f00bb982000) 2025-05-07T20:03:56.4839011Z libtorch.so => not found 2025-05-07T20:03:56.4839268Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4839547Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4839842Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.4840177Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f00bb92a000) 2025-05-07T20:03:56.4840524Z libc10.so => not found 2025-05-07T20:03:56.4840779Z libc10_cuda.so => not found 2025-05-07T20:03:56.4841034Z libtorch.so => not found 2025-05-07T20:03:56.4841304Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4841564Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4841844Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.4842109Z libc10.so => not found 2025-05-07T20:03:56.4842616Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f009b3bd000) 2025-05-07T20:03:56.4843188Z libtorch.so => not found 2025-05-07T20:03:56.4843434Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4843703Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4843957Z libtorch.so => not found 2025-05-07T20:03:56.4844207Z libc10.so => not found 2025-05-07T20:03:56.4844442Z libc10_cuda.so => not found 2025-05-07T20:03:56.4844708Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4844977Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4845353Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.4845615Z libtorch.so => not found 2025-05-07T20:03:56.4845848Z libc10.so => not found 2025-05-07T20:03:56.4846082Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4846336Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4846686Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f009b3b4000) 2025-05-07T20:03:56.4847053Z libtorch_cpu.so => not found 2025-05-07T20:03:56.4847322Z libtorch_cuda.so => not found 2025-05-07T20:03:56.4847568Z libtorch.so => not found 2025-05-07T20:03:56.4847863Z librt.so.1 => /lib64/librt.so.1 (0x00007f009b3ad000) 2025-05-07T20:03:56.4848134Z 2025-05-07T20:03:56.4848246Z [CHECK] Displaying ELF information: 2025-05-07T20:03:56.4848697Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:56.4849082Z 2025-05-07T20:03:56.4849088Z 2025-05-07T20:03:56.4849249Z Dynamic section at offset 0x1eb9cd68 contains 42 entries: 2025-05-07T20:03:56.4849634Z Tag Type Name/Value 2025-05-07T20:03:56.4850048Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:56.4850550Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:56.4851077Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:56.4851663Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:56.4852212Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:56.4852709Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:56.4853222Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:56.4853748Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:56.4854264Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:56.4854776Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:56.4855339Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:56.4855836Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:56.4856330Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:56.4856852Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:56.4857406Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:56.4857993Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:56.4858537Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:56.4858966Z 0x000000000000000c (INIT) 0x5b0000 2025-05-07T20:03:56.4859299Z 0x000000000000000d (FINI) 0x2ee447c 2025-05-07T20:03:56.4859649Z 0x0000000000000019 (INIT_ARRAY) 0x1eb90820 2025-05-07T20:03:56.4860019Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:03:56.4860364Z 0x000000000000001a (FINI_ARRAY) 0x1eb90f40 2025-05-07T20:03:56.4860707Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:56.4861042Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:56.4861365Z 0x0000000000000005 (STRTAB) 0x5ab08 2025-05-07T20:03:56.4861683Z 0x0000000000000006 (SYMTAB) 0x11200 2025-05-07T20:03:56.4862039Z 0x000000000000000a (STRSZ) 5105620 (bytes) 2025-05-07T20:03:56.4862400Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:56.4862741Z 0x0000000000000003 (PLTGOT) 0x1eb9e048 2025-05-07T20:03:56.4863104Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:03:56.4863442Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:56.4863765Z 0x0000000000000017 (JMPREL) 0x59f9b0 2025-05-07T20:03:56.4864086Z 0x0000000000000007 (RELA) 0x53f668 2025-05-07T20:03:56.4864443Z 0x0000000000000008 (RELASZ) 394056 (bytes) 2025-05-07T20:03:56.4864789Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:56.4865115Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:56.4865443Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:56.4865789Z 0x000000006ffffffe (VERNEED) 0x53f4f8 2025-05-07T20:03:56.4866124Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:56.4866436Z 0x000000006ffffff0 (VERSYM) 0x5392dc 2025-05-07T20:03:56.4866772Z 0x000000006ffffff9 (RELACOUNT) 2708 2025-05-07T20:03:56.4867072Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:56.4867282Z 2025-05-07T20:03:56.4867387Z ################################################################################ 2025-05-07T20:03:56.4867608Z 2025-05-07T20:03:56.4867611Z 2025-05-07T20:03:56.4867729Z ################################################################################ 2025-05-07T20:03:56.4868259Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.4868795Z [CHECK] Listing out library size: 2025-05-07T20:03:56.4869282Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.4869712Z 2025-05-07T20:03:56.4869950Z 76 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.4870299Z 2025-05-07T20:03:56.4870739Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.4871807Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.4872548Z 2025-05-07T20:03:56.5079454Z GLIBC_2.2.5 2025-05-07T20:03:56.5080902Z GLIBC_2.3 2025-05-07T20:03:56.5081596Z GLIBC_2.14 2025-05-07T20:03:56.5081972Z 2025-05-07T20:03:56.5081977Z 2025-05-07T20:03:56.5082462Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.5083831Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.5084540Z 2025-05-07T20:03:56.5352051Z GLIBCXX_3.4 2025-05-07T20:03:56.5352733Z GLIBCXX_3.4.9 2025-05-07T20:03:56.5352987Z GLIBCXX_3.4.11 2025-05-07T20:03:56.5353208Z GLIBCXX_3.4.18 2025-05-07T20:03:56.5353414Z GLIBCXX_3.4.20 2025-05-07T20:03:56.5353639Z GLIBCXX_3.4.21 2025-05-07T20:03:56.5353765Z 2025-05-07T20:03:56.5353769Z 2025-05-07T20:03:56.5373965Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.r3b7DhQtas.symbols.txt 2025-05-07T20:03:56.5374636Z 2025-05-07T20:03:56.5613526Z 2025-05-07T20:03:56.5638581Z [CHECK] Total Number of symbols: 1609 2025-05-07T20:03:56.5660453Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:03:56.5675440Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.hjeumdwjNw.usymbols.txt 2025-05-07T20:03:56.5676020Z 2025-05-07T20:03:56.5701051Z 2025-05-07T20:03:56.5736973Z [CHECK] Listing out undefined symbols (176 total): 2025-05-07T20:03:56.5758900Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.5762987Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.5763759Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:56.5764138Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.5764554Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.5764958Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.5765362Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:56.5765750Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:56.5766118Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:56.5766491Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.5766854Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:56.5767165Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:56.5767495Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:56.5767796Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:56.5768118Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:56.5768451Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:56.5768768Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:56.5769107Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:56.5769508Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:56.5769946Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:56.5770383Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:56.5770863Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:56.5771757Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.5773121Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.5774317Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:56.5775257Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:56.5776157Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.5777412Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.5778262Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:56.5778675Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:56.5779008Z U at::globalContext() 2025-05-07T20:03:56.5779458Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.5779872Z U c10::BoolType::get() 2025-05-07T20:03:56.5780241Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:56.5780613Z U c10::FloatType::get() 2025-05-07T20:03:56.5780930Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:56.5781340Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.5781768Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:56.5782117Z U c10::IntType::get() 2025-05-07T20:03:56.5782477Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:56.5782881Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:56.5783268Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.5783678Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:56.5784088Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:56.5784749Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:56.5785408Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:56.5786168Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:56.5786585Z U c10::SymIntType::get() 2025-05-07T20:03:56.5786968Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:56.5787396Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.5787783Z U c10::TensorType::get() 2025-05-07T20:03:56.5788115Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:56.5789100Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:56.5790103Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:56.5790473Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:56.5790831Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:56.5791182Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:56.5791536Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:56.5791897Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:56.5792444Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:56.5792940Z U c10::cuda::device_count() 2025-05-07T20:03:56.5793288Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:56.5793684Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:56.5794087Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:56.5794559Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:56.5794983Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:56.5795365Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:56.5796216Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:56.5797132Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:56.5798013Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.5799039Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:56.5800117Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.5800950Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:56.5801300Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:56.5801666Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:56.5814488Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:56.5814920Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:56.5815312Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:56.5815717Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:56.5816084Z U c10::throwNullDataPtrError() 2025-05-07T20:03:56.5816422Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:56.5816746Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:56.5817172Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:56.5817596Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:56.5817964Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.5818349Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.5818720Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:56.5818840Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:56.5818985Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:56.5819102Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:56.5819223Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:56.5819366Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.5819494Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.5819635Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:56.5819768Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:56.5819886Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:56.5820006Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:56.5820128Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:56.5820281Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.5820406Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:56.5822658Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:56.5822977Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:56.5823309Z U float at::Tensor::item() const 2025-05-07T20:03:56.5823474Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.5823669Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.5823801Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.5823972Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.5824188Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:56.5824327Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.5824490Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.5824594Z U memcpy@GLIBC_2.14 2025-05-07T20:03:56.5824693Z U memset@GLIBC_2.2.5 2025-05-07T20:03:56.5824829Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:56.5824958Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:56.5825298Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.5825736Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.5826090Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.5826409Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.5826755Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:56.5827172Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:56.5827578Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.5828330Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.5828727Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:56.5829150Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.5829637Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:56.5830168Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.5830521Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:56.5830907Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:56.5831029Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:56.5831159Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:56.5831303Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.5831486Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.5831684Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:56.5831828Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:56.5832240Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:56.5833415Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.5833578Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:56.5833701Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:56.5833866Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:56.5833986Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:56.5834176Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.5834434Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.5834571Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:56.5834750Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:56.5834918Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:56.5835148Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:56.5836111Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:56.5836603Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.5836868Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.5837243Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:56.5837832Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:56.5839320Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.5840826Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.5842435Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.5843879Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.5845336Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.5847034Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.5849118Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.5851019Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.5852868Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.5854724Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.5856543Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.5858379Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.5860134Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:56.5860308Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:56.5860461Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:56.5860625Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:56.5860943Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.5861157Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:56.5861280Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:56.5861389Z w _ITM_registerTMCloneTable 2025-05-07T20:03:56.5861491Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:56.5861591Z w __gmon_start__ 2025-05-07T20:03:56.5861688Z w __pthread_key_create 2025-05-07T20:03:56.5861792Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:56.5861904Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:56.5862089Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:56.5862334Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.5862343Z 2025-05-07T20:03:56.5862463Z linux-vdso.so.1 (0x00007ffec5bb4000) 2025-05-07T20:03:56.5862547Z libc10.so => not found 2025-05-07T20:03:56.5862639Z libc10_cuda.so => not found 2025-05-07T20:03:56.5863193Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f27de200000) 2025-05-07T20:03:56.5863291Z libtorch.so => not found 2025-05-07T20:03:56.5863382Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5863470Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5863581Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.5863737Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f27ddf9c000) 2025-05-07T20:03:56.5863880Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2803304000) 2025-05-07T20:03:56.5864019Z libc.so.6 => /lib64/libc.so.6 (0x00007f27ddd94000) 2025-05-07T20:03:56.5864135Z /lib64/ld-linux-x86-64.so.2 (0x00007f2803338000) 2025-05-07T20:03:56.5864219Z libc10.so => not found 2025-05-07T20:03:56.5864309Z libc10_cuda.so => not found 2025-05-07T20:03:56.5864774Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f27ddb9e000) 2025-05-07T20:03:56.5865297Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f280320a000) 2025-05-07T20:03:56.5865381Z libtorch.so => not found 2025-05-07T20:03:56.5865729Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f27dd600000) 2025-05-07T20:03:56.5866161Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f27dcc00000) 2025-05-07T20:03:56.5866257Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5866356Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5866449Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.5866593Z libm.so.6 => /lib64/libm.so.6 (0x00007f27dd525000) 2025-05-07T20:03:56.5866680Z libc10.so => not found 2025-05-07T20:03:56.5866776Z libc10_cuda.so => not found 2025-05-07T20:03:56.5867191Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f27fe3f5000) 2025-05-07T20:03:56.5867308Z libtorch.so => not found 2025-05-07T20:03:56.5867429Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5867521Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5867608Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.5867764Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f27fe39f000) 2025-05-07T20:03:56.5867848Z libc10.so => not found 2025-05-07T20:03:56.5867936Z libc10_cuda.so => not found 2025-05-07T20:03:56.5868026Z libtorch.so => not found 2025-05-07T20:03:56.5868152Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5868240Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5868332Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.5868421Z libc10.so => not found 2025-05-07T20:03:56.5868763Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f27dd4ad000) 2025-05-07T20:03:56.5868850Z libtorch.so => not found 2025-05-07T20:03:56.5868939Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5869039Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5869125Z libtorch.so => not found 2025-05-07T20:03:56.5869207Z libc10.so => not found 2025-05-07T20:03:56.5869305Z libc10_cuda.so => not found 2025-05-07T20:03:56.5869390Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5869478Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5869570Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.5869666Z libtorch.so => not found 2025-05-07T20:03:56.5869746Z libc10.so => not found 2025-05-07T20:03:56.5869837Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5869933Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5870097Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f27fe394000) 2025-05-07T20:03:56.5870187Z libtorch_cpu.so => not found 2025-05-07T20:03:56.5870277Z libtorch_cuda.so => not found 2025-05-07T20:03:56.5870368Z libtorch.so => not found 2025-05-07T20:03:56.5870494Z librt.so.1 => /lib64/librt.so.1 (0x00007f27fe38d000) 2025-05-07T20:03:56.5870501Z 2025-05-07T20:03:56.5870604Z [CHECK] Displaying ELF information: 2025-05-07T20:03:56.5870884Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:56.5870889Z 2025-05-07T20:03:56.5878478Z 2025-05-07T20:03:56.5878772Z Dynamic section at offset 0x4b7dd08 contains 38 entries: 2025-05-07T20:03:56.5878999Z Tag Type Name/Value 2025-05-07T20:03:56.5879412Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:56.5879677Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:56.5879940Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:56.5880144Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:56.5880351Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:56.5880556Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:56.5880778Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:56.5880987Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:56.5881271Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:56.5881627Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:56.5882010Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:56.5882554Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:03:56.5882925Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:56.5883053Z 0x000000000000000c (INIT) 0xac000 2025-05-07T20:03:56.5883170Z 0x000000000000000d (FINI) 0x5df4cc 2025-05-07T20:03:56.5883289Z 0x0000000000000019 (INIT_ARRAY) 0x4b7d9f8 2025-05-07T20:03:56.5883471Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:03:56.5883593Z 0x000000000000001a (FINI_ARRAY) 0x4b7dac0 2025-05-07T20:03:56.5883744Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:56.5883873Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:56.5883997Z 0x0000000000000005 (STRTAB) 0xc368 2025-05-07T20:03:56.5884108Z 0x0000000000000006 (SYMTAB) 0x2c78 2025-05-07T20:03:56.5884257Z 0x000000000000000a (STRSZ) 595540 (bytes) 2025-05-07T20:03:56.5884403Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:56.5884525Z 0x0000000000000003 (PLTGOT) 0x4b7efa8 2025-05-07T20:03:56.5884661Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:03:56.5884782Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:56.5884895Z 0x0000000000000017 (JMPREL) 0xa7fe0 2025-05-07T20:03:56.5885004Z 0x0000000000000007 (RELA) 0x9e770 2025-05-07T20:03:56.5885153Z 0x0000000000000008 (RELASZ) 39024 (bytes) 2025-05-07T20:03:56.5885276Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:56.5885374Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:56.5885506Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:56.5885623Z 0x000000006ffffffe (VERNEED) 0x9e650 2025-05-07T20:03:56.5885967Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:56.5886119Z 0x000000006ffffff0 (VERSYM) 0x9d9bc 2025-05-07T20:03:56.5886235Z 0x000000006ffffff9 (RELACOUNT) 239 2025-05-07T20:03:56.5886334Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:56.5886341Z 2025-05-07T20:03:56.5886461Z ################################################################################ 2025-05-07T20:03:56.5886478Z 2025-05-07T20:03:56.5886481Z 2025-05-07T20:03:56.5886593Z ################################################################################ 2025-05-07T20:03:56.5886938Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.5887048Z [CHECK] Listing out library size: 2025-05-07T20:03:56.5887384Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.5887389Z 2025-05-07T20:03:56.5896466Z 31 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.5897462Z 2025-05-07T20:03:56.5901733Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.5902301Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.6072799Z 2025-05-07T20:03:56.6073051Z GLIBC_2.2.5 2025-05-07T20:03:56.6073230Z GLIBC_2.3 2025-05-07T20:03:56.6073348Z GLIBC_2.14 2025-05-07T20:03:56.6073387Z 2025-05-07T20:03:56.6073392Z 2025-05-07T20:03:56.6074076Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.6074996Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.6075017Z 2025-05-07T20:03:56.6239446Z GLIBCXX_3.4 2025-05-07T20:03:56.6239687Z GLIBCXX_3.4.9 2025-05-07T20:03:56.6239918Z GLIBCXX_3.4.11 2025-05-07T20:03:56.6240122Z GLIBCXX_3.4.15 2025-05-07T20:03:56.6240341Z GLIBCXX_3.4.18 2025-05-07T20:03:56.6240542Z GLIBCXX_3.4.20 2025-05-07T20:03:56.6240897Z GLIBCXX_3.4.21 2025-05-07T20:03:56.6241276Z 2025-05-07T20:03:56.6241543Z 2025-05-07T20:03:56.6265372Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.1iXrmbsSnT.symbols.txt 2025-05-07T20:03:56.6265946Z 2025-05-07T20:03:56.6393931Z 2025-05-07T20:03:56.6424654Z [CHECK] Total Number of symbols: 1857 2025-05-07T20:03:56.6441024Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:03:56.6460979Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.onYVMeatj4.usymbols.txt 2025-05-07T20:03:56.6462668Z 2025-05-07T20:03:56.6485512Z 2025-05-07T20:03:56.6515142Z [CHECK] Listing out undefined symbols (267 total): 2025-05-07T20:03:56.6531499Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.6532619Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.6533203Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:56.6533570Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.6533974Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.6534378Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.6534759Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:56.6535155Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:56.6535507Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:56.6535880Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.6536255Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:56.6536571Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:56.6536893Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:56.6537207Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:56.6537535Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:56.6537856Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:56.6538173Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:56.6538485Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:56.6538805Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:56.6539129Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:56.6539446Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:56.6539761Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:56.6540070Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:56.6540392Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:56.6540761Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:56.6541201Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:56.6541629Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:56.6541962Z U at::RecordFunction::end() 2025-05-07T20:03:56.6542302Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:56.6542671Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:56.6543121Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:56.6543589Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:56.6544481Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.6545911Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.6546874Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:56.6547603Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.6548796Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.6549588Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:56.6549924Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:56.6550304Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:56.6550676Z U at::globalContext() 2025-05-07T20:03:56.6550991Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:56.6551295Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:56.6551566Z U c10::AnyType::get() 2025-05-07T20:03:56.6551948Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.6552333Z U c10::BoolType::get() 2025-05-07T20:03:56.6552793Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:56.6553441Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:56.6553920Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:56.6554677Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:56.6555973Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:56.6557127Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:56.6557723Z U c10::Error::what() const 2025-05-07T20:03:56.6558041Z U c10::FloatType::get() 2025-05-07T20:03:56.6558359Z U c10::GradMode::is_enabled() 2025-05-07T20:03:56.6558676Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:56.6559083Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.6559531Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:56.6559932Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:56.6560278Z U c10::IValue::isBoolList() const 2025-05-07T20:03:56.6560599Z U c10::IValue::isIntList() const 2025-05-07T20:03:56.6560939Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:56.6561266Z U c10::IValue::isTensorList() const 2025-05-07T20:03:56.6561637Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:56.6561989Z U c10::IntType::get() 2025-05-07T20:03:56.6562359Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:56.6562775Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:56.6563127Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:56.6563490Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:56.6563940Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:56.6564564Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:56.6565132Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.6565599Z U c10::StringType::get() 2025-05-07T20:03:56.6565971Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:56.6566344Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:56.6566748Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:56.6567184Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:56.6567597Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:56.6568237Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:56.6568856Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:56.6569212Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:56.6569593Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:56.6569947Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:56.6570289Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:56.6570639Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:56.6570989Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:56.6571316Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:56.6571626Z U c10::SymIntType::get() 2025-05-07T20:03:56.6571959Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:56.6572322Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:56.6572687Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.6573032Z U c10::TensorType::get() 2025-05-07T20:03:56.6573337Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:56.6574230Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:56.6575150Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:56.6575496Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:56.6575818Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:56.6576145Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:56.6576461Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:56.6576787Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:56.6577240Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:56.6577679Z U c10::cuda::device_count() 2025-05-07T20:03:56.6578009Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:56.6578360Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:56.6578728Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:56.6579098Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:56.6579470Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:56.6579838Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:56.6580450Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:56.6581634Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:56.6582712Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:56.6583597Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.6584600Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:56.6585844Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.6586772Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:56.6587122Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:56.6587664Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:56.6588310Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:56.6588818Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:56.6589246Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:56.6589665Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:56.6590006Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:56.6590390Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:56.6591052Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:56.6591667Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:56.6592051Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:56.6592522Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:56.6592940Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:56.6593388Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:56.6593794Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:56.6594172Z U c10::throwNullDataPtrError() 2025-05-07T20:03:56.6594500Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:56.6594842Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:56.6595264Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:56.6595713Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:56.6596088Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.6596465Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.6596848Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:56.6597210Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:56.6597573Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:56.6597920Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:56.6598268Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:56.6598630Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.6598990Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.6599374Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:56.6599745Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:56.6600104Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:56.6600479Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:56.6600838Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:56.6601222Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.6601606Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:56.6604153Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:56.6606912Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:56.6607433Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.6607825Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.6608199Z U free@GLIBC_2.2.5 2025-05-07T20:03:56.6608555Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.6608918Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.6609344Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:56.6609742Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.6610137Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.6610484Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:56.6610781Z U memcpy@GLIBC_2.14 2025-05-07T20:03:56.6611082Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:56.6611364Z U memset@GLIBC_2.2.5 2025-05-07T20:03:56.6611676Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:56.6612014Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:56.6612570Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.6613305Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.6613846Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:56.6614258Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:56.6614895Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:56.6615727Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:56.6616619Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6617637Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.6618657Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6619554Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6620718Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:56.6621838Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6623051Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6624121Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6625214Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:56.6626084Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:56.6627039Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:56.6627698Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:56.6628081Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:56.6628465Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.6628916Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.6629345Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:56.6629786Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:56.6630176Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:56.6630694Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:56.6631663Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.6632577Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:56.6632963Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:56.6633422Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:56.6633781Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:56.6634143Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:56.6634560Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.6635129Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.6635623Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:56.6636043Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6636471Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:56.6637149Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:56.6637846Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:56.6638216Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:56.6638524Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:56.6638816Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:56.6639124Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:56.6639965Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:56.6641168Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.6642017Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.6642533Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:56.6643070Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:56.6643690Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:56.6644204Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:56.6644756Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:56.6645428Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:56.6646077Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:56.6646575Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:56.6647235Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:56.6647655Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:56.6648011Z U torch::autograd::Node::metadata() 2025-05-07T20:03:56.6648397Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:56.6648912Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:56.6649728Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:56.6650266Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:56.6650719Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:56.6651272Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:56.6654321Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:56.6657292Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:56.6657707Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:56.6658153Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:56.6659231Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:56.6660279Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:56.6660959Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:56.6661860Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:56.6662892Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:56.6663667Z U typeinfo for c10::Error 2025-05-07T20:03:56.6664031Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:56.6664408Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:56.6664785Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:56.6665151Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:56.6665547Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:56.6667248Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.6670319Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.6673356Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.6676311Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.6679259Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.6682180Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:56.6683853Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:56.6684280Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:56.6684720Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:56.6685152Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:56.6685530Z U vtable for c10::Error 2025-05-07T20:03:56.6686229Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.6686809Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:56.6687283Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:56.6687746Z U vtable for torch::autograd::Node 2025-05-07T20:03:56.6688140Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:56.6688546Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:56.6688861Z w _ITM_registerTMCloneTable 2025-05-07T20:03:56.6689174Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:56.6689465Z w __gmon_start__ 2025-05-07T20:03:56.6689740Z w __pthread_key_create 2025-05-07T20:03:56.6690047Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:56.6690436Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:56.6690807Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:56.6691320Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.6691743Z 2025-05-07T20:03:56.6691852Z linux-vdso.so.1 (0x00007ffccf7f6000) 2025-05-07T20:03:56.6692137Z libc10.so => not found 2025-05-07T20:03:56.6692419Z libc10_cuda.so => not found 2025-05-07T20:03:56.6693177Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f86e6e00000) 2025-05-07T20:03:56.6693960Z libtorch.so => not found 2025-05-07T20:03:56.6694213Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6694473Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6694783Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.6695112Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f86e6b9c000) 2025-05-07T20:03:56.6695540Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f870901d000) 2025-05-07T20:03:56.6695941Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8706fd2000) 2025-05-07T20:03:56.6696324Z libc.so.6 => /lib64/libc.so.6 (0x00007f86e6994000) 2025-05-07T20:03:56.6696688Z /lib64/ld-linux-x86-64.so.2 (0x00007f8709079000) 2025-05-07T20:03:56.6697005Z libc10.so => not found 2025-05-07T20:03:56.6697253Z libc10_cuda.so => not found 2025-05-07T20:03:56.6697903Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f86e679e000) 2025-05-07T20:03:56.6699212Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f86e66a6000) 2025-05-07T20:03:56.6699914Z libtorch.so => not found 2025-05-07T20:03:56.6700399Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f86e6000000) 2025-05-07T20:03:56.6701467Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f86e5600000) 2025-05-07T20:03:56.6702117Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6702382Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6702643Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.6702944Z libm.so.6 => /lib64/libm.so.6 (0x00007f86e65cb000) 2025-05-07T20:03:56.6703255Z libc10.so => not found 2025-05-07T20:03:56.6703492Z libc10_cuda.so => not found 2025-05-07T20:03:56.6704102Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f870900c000) 2025-05-07T20:03:56.6704737Z libtorch.so => not found 2025-05-07T20:03:56.6704988Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6705251Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6705522Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.6705774Z libc10.so => not found 2025-05-07T20:03:56.6706010Z libc10_cuda.so => not found 2025-05-07T20:03:56.6706256Z libtorch.so => not found 2025-05-07T20:03:56.6706506Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6706768Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6707024Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.6707280Z libc10.so => not found 2025-05-07T20:03:56.6707779Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f8706f56000) 2025-05-07T20:03:56.6708333Z libtorch.so => not found 2025-05-07T20:03:56.6708573Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6708839Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6709087Z libtorch.so => not found 2025-05-07T20:03:56.6709333Z libc10.so => not found 2025-05-07T20:03:56.6709556Z libc10_cuda.so => not found 2025-05-07T20:03:56.6709818Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6710079Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6710335Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.6710600Z libtorch.so => not found 2025-05-07T20:03:56.6710859Z libc10.so => not found 2025-05-07T20:03:56.6711104Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6711356Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6711695Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f8706f4b000) 2025-05-07T20:03:56.6712061Z libtorch_cpu.so => not found 2025-05-07T20:03:56.6712350Z libtorch_cuda.so => not found 2025-05-07T20:03:56.6712678Z libtorch.so => not found 2025-05-07T20:03:56.6713185Z librt.so.1 => /lib64/librt.so.1 (0x00007f8706f44000) 2025-05-07T20:03:56.6713429Z 2025-05-07T20:03:56.6713575Z [CHECK] Displaying ELF information: 2025-05-07T20:03:56.6714076Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:56.6714500Z 2025-05-07T20:03:56.6714504Z 2025-05-07T20:03:56.6714701Z Dynamic section at offset 0x1e278a8 contains 39 entries: 2025-05-07T20:03:56.6715086Z Tag Type Name/Value 2025-05-07T20:03:56.6715516Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:56.6716039Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:56.6716618Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:56.6717213Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:56.6717732Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:56.6718275Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:56.6718811Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:56.6719359Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:56.6719892Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:56.6720403Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:56.6720926Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:56.6721450Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:56.6722084Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:03:56.6722674Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:56.6723098Z 0x000000000000000c (INIT) 0x79000 2025-05-07T20:03:56.6723448Z 0x000000000000000d (FINI) 0x25a06c 2025-05-07T20:03:56.6723792Z 0x0000000000000019 (INIT_ARRAY) 0x1e260e0 2025-05-07T20:03:56.6724171Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:03:56.6724528Z 0x000000000000001a (FINI_ARRAY) 0x1e26198 2025-05-07T20:03:56.6724895Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:56.6725353Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:56.6725679Z 0x0000000000000005 (STRTAB) 0xe130 2025-05-07T20:03:56.6726005Z 0x0000000000000006 (SYMTAB) 0x3300 2025-05-07T20:03:56.6726335Z 0x000000000000000a (STRSZ) 373406 (bytes) 2025-05-07T20:03:56.6726691Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:56.6727018Z 0x0000000000000003 (PLTGOT) 0x1e27b58 2025-05-07T20:03:56.6727369Z 0x0000000000000002 (PLTRELSZ) 18480 (bytes) 2025-05-07T20:03:56.6727689Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:56.6727998Z 0x0000000000000017 (JMPREL) 0x73f80 2025-05-07T20:03:56.6728305Z 0x0000000000000007 (RELA) 0x6a398 2025-05-07T20:03:56.6728642Z 0x0000000000000008 (RELASZ) 39912 (bytes) 2025-05-07T20:03:56.6728987Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:56.6729290Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:56.6729601Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:56.6729925Z 0x000000006ffffffe (VERNEED) 0x6a258 2025-05-07T20:03:56.6730284Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:56.6730586Z 0x000000006ffffff0 (VERSYM) 0x693ce 2025-05-07T20:03:56.6730906Z 0x000000006ffffff9 (RELACOUNT) 270 2025-05-07T20:03:56.6731195Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:56.6731427Z 2025-05-07T20:03:56.6731534Z ################################################################################ 2025-05-07T20:03:56.6731770Z 2025-05-07T20:03:56.6731775Z 2025-05-07T20:03:56.6731897Z ################################################################################ 2025-05-07T20:03:56.6732409Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.6732926Z [CHECK] Listing out library size: 2025-05-07T20:03:56.6733419Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.6733831Z 2025-05-07T20:03:56.6734066Z 175 ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.6734408Z 2025-05-07T20:03:56.6734832Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.6735850Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.6736478Z 2025-05-07T20:03:56.7272717Z GLIBC_2.2.5 2025-05-07T20:03:56.7273092Z GLIBC_2.3 2025-05-07T20:03:56.7273339Z GLIBC_2.14 2025-05-07T20:03:56.7273585Z 2025-05-07T20:03:56.7273720Z 2025-05-07T20:03:56.7274245Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.7275443Z + objdump -TC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:56.7904627Z 2025-05-07T20:03:56.7905044Z GLIBCXX_3.4 2025-05-07T20:03:56.7905736Z GLIBCXX_3.4.9 2025-05-07T20:03:56.7906370Z GLIBCXX_3.4.11 2025-05-07T20:03:56.7906983Z GLIBCXX_3.4.18 2025-05-07T20:03:56.7907583Z GLIBCXX_3.4.20 2025-05-07T20:03:56.7908152Z GLIBCXX_3.4.21 2025-05-07T20:03:56.7908536Z 2025-05-07T20:03:56.7908549Z 2025-05-07T20:03:56.7932886Z + nm -gDC ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.65QAKwSVnk.symbols.txt 2025-05-07T20:03:56.7934521Z 2025-05-07T20:03:56.8525366Z 2025-05-07T20:03:56.8564300Z [CHECK] Total Number of symbols: 3695 2025-05-07T20:03:56.8596962Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:03:56.8616980Z + nm -gDCu ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.hvnUsV7cso.usymbols.txt 2025-05-07T20:03:56.8618678Z 2025-05-07T20:03:56.8648832Z 2025-05-07T20:03:56.8675546Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:56.8698869Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.8699711Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.8700291Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:56.8700698Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.8701143Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:56.8701542Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.8701944Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:56.8702332Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:56.8702761Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:56.8703148Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:56.8704809Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:56.8705124Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:56.8705458Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:56.8705771Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:56.8706181Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:56.8706512Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:56.8706922Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:56.8707246Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:56.8707622Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:56.8708047Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:56.8708522Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:56.8708990Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:56.8709466Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:56.8710368Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.8711765Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.8712882Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:56.8713557Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:56.8714502Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.8715702Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:56.8716593Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:56.8717035Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:56.8717392Z U at::globalContext() 2025-05-07T20:03:56.8717818Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.8718257Z U c10::BoolType::get() 2025-05-07T20:03:56.8718627Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:56.8719016Z U c10::FloatType::get() 2025-05-07T20:03:56.8719343Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:56.8719766Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.8720206Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:56.8720577Z U c10::IntType::get() 2025-05-07T20:03:56.8720946Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:56.8721378Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:56.8721794Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.8722222Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:56.8722643Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:56.8723070Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:56.8723531Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:56.8724217Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:56.8724929Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:56.8725448Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:56.8725793Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:56.8726175Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:56.8726569Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:56.8726919Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:56.8727265Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:56.8727559Z U c10::SymIntType::get() 2025-05-07T20:03:56.8727905Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:56.8728326Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:56.8728687Z U c10::TensorType::get() 2025-05-07T20:03:56.8729011Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:56.8729907Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:56.8730833Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:56.8731189Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:56.8731514Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:56.8731848Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:56.8732168Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:56.8732501Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:56.8732940Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:56.8733394Z U c10::cuda::device_count() 2025-05-07T20:03:56.8733731Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:56.8734090Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:56.8734454Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:56.8734823Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:56.8735218Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:56.8735587Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:56.8736282Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:56.8737124Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:56.8737942Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.8738849Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:56.8739844Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.8740612Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:56.8740931Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:56.8741288Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:56.8741698Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:56.8742089Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:56.8742419Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:56.8742797Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:56.8743160Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:56.8743526Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:56.8743932Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:56.8744301Z U c10::throwNullDataPtrError() 2025-05-07T20:03:56.8744618Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:56.8744924Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:56.8745319Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:56.8745736Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:56.8746105Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.8746466Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.8746601Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:56.8746716Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:56.8746850Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:56.8746962Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:56.8747079Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:56.8747212Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.8747333Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:56.8747470Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:56.8747588Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:56.8747713Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:56.8747827Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:56.8747941Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:56.8748078Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:56.8748199Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:56.8750305Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:56.8750510Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:56.8750624Z U float at::Tensor::item() const 2025-05-07T20:03:56.8750762Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.8750914Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.8751036Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.8751182Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.8751354Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:56.8751484Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:56.8751628Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:56.8751734Z U memcpy@GLIBC_2.14 2025-05-07T20:03:56.8751827Z U memset@GLIBC_2.2.5 2025-05-07T20:03:56.8751937Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:56.8752068Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:56.8752466Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.8752991Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.8753322Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.8753729Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.8754057Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.8754401Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:56.8754785Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:56.8755197Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:56.8755627Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.8756186Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:56.8756609Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:56.8757042Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.8757511Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:56.8758058Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:56.8758403Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:56.8758791Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:56.8758929Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:56.8759047Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:56.8759198Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.8759362Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:56.8759545Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:56.8759678Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:56.8759946Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:56.8760552Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.8760689Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:56.8760817Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:56.8760937Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:56.8761057Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:56.8761252Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.8761527Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:56.8761662Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:56.8761783Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:56.8761929Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:56.8762057Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:56.8762707Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:56.8763215Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.8763484Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:56.8763877Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:56.8764448Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:56.8766517Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.8768583Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.8770564Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.8772783Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.8774599Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.8776493Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:56.8778212Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:56.8778363Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:56.8778534Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:56.8778688Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:56.8779005Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:56.8779233Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:56.8779347Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:56.8779454Z w _ITM_registerTMCloneTable 2025-05-07T20:03:56.8779554Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:56.8779658Z w __gmon_start__ 2025-05-07T20:03:56.8779756Z w __pthread_key_create 2025-05-07T20:03:56.8779867Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:56.8779985Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:56.8780128Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:56.8780377Z + ldd ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.8780387Z 2025-05-07T20:03:56.8780539Z linux-vdso.so.1 (0x00007ffe103a4000) 2025-05-07T20:03:56.8780626Z libc10.so => not found 2025-05-07T20:03:56.8780716Z libc10_cuda.so => not found 2025-05-07T20:03:56.8781280Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f10a2e00000) 2025-05-07T20:03:56.8781373Z libtorch.so => not found 2025-05-07T20:03:56.8781466Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8781574Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8781667Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.8781823Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f10a2b9c000) 2025-05-07T20:03:56.8781972Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f10ce6d3000) 2025-05-07T20:03:56.8782132Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f10ce6a5000) 2025-05-07T20:03:56.8782253Z libc.so.6 => /lib64/libc.so.6 (0x00007f10a2994000) 2025-05-07T20:03:56.8782379Z /lib64/ld-linux-x86-64.so.2 (0x00007f10ce72f000) 2025-05-07T20:03:56.8782482Z libc10.so => not found 2025-05-07T20:03:56.8782577Z libc10_cuda.so => not found 2025-05-07T20:03:56.8783033Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f10a279e000) 2025-05-07T20:03:56.8783573Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f10a26a6000) 2025-05-07T20:03:56.8783693Z libtorch.so => not found 2025-05-07T20:03:56.8784032Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so (0x00007f10a2000000) 2025-05-07T20:03:56.8784465Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f10a1600000) 2025-05-07T20:03:56.8784603Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8784697Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8784815Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.8784952Z libm.so.6 => /lib64/libm.so.6 (0x00007f10a25cb000) 2025-05-07T20:03:56.8785041Z libc10.so => not found 2025-05-07T20:03:56.8785128Z libc10_cuda.so => not found 2025-05-07T20:03:56.8785886Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so (0x00007f10ce694000) 2025-05-07T20:03:56.8786156Z libtorch.so => not found 2025-05-07T20:03:56.8786252Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8786348Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8786560Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.8786649Z libc10.so => not found 2025-05-07T20:03:56.8786739Z libc10_cuda.so => not found 2025-05-07T20:03:56.8786848Z libtorch.so => not found 2025-05-07T20:03:56.8786992Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8787090Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8787186Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.8787289Z libc10.so => not found 2025-05-07T20:03:56.8787658Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so (0x00007f10c2f88000) 2025-05-07T20:03:56.8787750Z libtorch.so => not found 2025-05-07T20:03:56.8787860Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8787956Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8788051Z libtorch.so => not found 2025-05-07T20:03:56.8788143Z libc10.so => not found 2025-05-07T20:03:56.8788247Z libc10_cuda.so => not found 2025-05-07T20:03:56.8788344Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8788440Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8788551Z libcudart.so.11.0 => not found 2025-05-07T20:03:56.8788646Z libtorch.so => not found 2025-05-07T20:03:56.8788738Z libc10.so => not found 2025-05-07T20:03:56.8788831Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8788946Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8789128Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f10ce685000) 2025-05-07T20:03:56.8789226Z libtorch_cpu.so => not found 2025-05-07T20:03:56.8789336Z libtorch_cuda.so => not found 2025-05-07T20:03:56.8789428Z libtorch.so => not found 2025-05-07T20:03:56.8789570Z librt.so.1 => /lib64/librt.so.1 (0x00007f10ce67e000) 2025-05-07T20:03:56.8789575Z 2025-05-07T20:03:56.8789703Z [CHECK] Displaying ELF information: 2025-05-07T20:03:56.8790003Z + readelf -d ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:56.8790008Z 2025-05-07T20:03:56.8799683Z 2025-05-07T20:03:56.8800296Z Dynamic section at offset 0xaed9e48 contains 39 entries: 2025-05-07T20:03:56.8800652Z Tag Type Name/Value 2025-05-07T20:03:56.8801269Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:56.8801862Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:56.8802658Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:56.8803279Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:56.8803491Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:56.8803712Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:56.8803930Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:56.8804140Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:56.8804346Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:56.8804673Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:56.8804871Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:56.8805090Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:56.8805471Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:03:56.8805662Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:56.8805787Z 0x000000000000000c (INIT) 0x1ad000 2025-05-07T20:03:56.8805917Z 0x000000000000000d (FINI) 0xe4d99c 2025-05-07T20:03:56.8806044Z 0x0000000000000019 (INIT_ARRAY) 0xaed55e8 2025-05-07T20:03:56.8806216Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:03:56.8806357Z 0x000000000000001a (FINI_ARRAY) 0xaed5890 2025-05-07T20:03:56.8806482Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:56.8806601Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:56.8806759Z 0x0000000000000005 (STRTAB) 0x1b3a0 2025-05-07T20:03:56.8806872Z 0x0000000000000006 (SYMTAB) 0x5920 2025-05-07T20:03:56.8807022Z 0x000000000000000a (STRSZ) 1481806 (bytes) 2025-05-07T20:03:56.8807144Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:56.8807283Z 0x0000000000000003 (PLTGOT) 0xaedb0f8 2025-05-07T20:03:56.8807421Z 0x0000000000000002 (PLTRELSZ) 22176 (bytes) 2025-05-07T20:03:56.8807536Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:56.8807669Z 0x0000000000000017 (JMPREL) 0x1a6bf0 2025-05-07T20:03:56.8807783Z 0x0000000000000007 (RELA) 0x186df0 2025-05-07T20:03:56.8807923Z 0x0000000000000008 (RELASZ) 130560 (bytes) 2025-05-07T20:03:56.8808050Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:56.8808171Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:56.8808299Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:56.8808425Z 0x000000006ffffffe (VERNEED) 0x186cd0 2025-05-07T20:03:56.8808553Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:56.8808678Z 0x000000006ffffff0 (VERSYM) 0x184fee 2025-05-07T20:03:56.8808790Z 0x000000006ffffff9 (RELACOUNT) 811 2025-05-07T20:03:56.8808908Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:56.8808913Z 2025-05-07T20:03:56.8809032Z ################################################################################ 2025-05-07T20:03:56.8809037Z 2025-05-07T20:03:56.8809041Z 2025-05-07T20:03:56.8809250Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:03:56.8916708Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.8943951Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.8993536Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.9030927Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.9261283Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.9303539Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.9341994Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.9371718Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:56.9483258Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9510504Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9560524Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9591991Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9827929Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9866257Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9898248Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:56.9925046Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.0333138Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.0691864Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.1601193Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.1822617Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.1909805Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.1942509Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.2268207Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.11/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:57.2270090Z ################################################################################ 2025-05-07T20:03:57.2271683Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:03:57.2272114Z 2025-05-07T20:03:57.2272718Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:03:57.2273299Z 2025-05-07T20:04:05.2836379Z 2025-05-07T20:04:05.2836902Z fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl is 2025-05-07T20:04:05.2837474Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:04:05.2837805Z 2025-05-07T20:04:05.2837993Z The wheel references external versioned symbols in these 2025-05-07T20:04:05.2838447Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:04:05.2838900Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0', 2025-05-07T20:04:05.2839331Z 'GCC_12.0.0'}, libstdc++.so.6 with versions {'GLIBCXX_3.4.14', 2025-05-07T20:04:05.2839798Z 'GLIBCXX_3.4.20', 'CXXABI_1.3.7', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.19', 2025-05-07T20:04:05.2840270Z 'CXXABI_1.3.11', 'GLIBCXX_3.4', 'GLIBCXX_3.4.15', 'CXXABI_1.3.5', 2025-05-07T20:04:05.2840726Z 'CXXABI_1.3.3', 'GLIBCXX_3.4.9', 'GLIBCXX_3.4.18', 'GLIBCXX_3.4.11', 2025-05-07T20:04:05.2841201Z 'CXXABI_1.3'}, libc.so.6 with versions {'GLIBC_2.3.3', 'GLIBC_2.17', 2025-05-07T20:04:05.2841659Z 'GLIBC_2.3', 'GLIBC_2.2.5', 'GLIBC_2.3.2', 'GLIBC_2.6', 'GLIBC_2.14'}, 2025-05-07T20:04:05.2842134Z libpthread.so.0 with versions {'GLIBC_2.2.5', 'GLIBC_2.3.4'}, 2025-05-07T20:04:05.2842807Z libm.so.6 with versions {'GLIBC_2.2.5'}, libcudart.so.11.0 with 2025-05-07T20:04:05.2843276Z versions {'libcudart.so.11.0'}, libgomp.so.1 with versions 2025-05-07T20:04:05.2843702Z {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5'} 2025-05-07T20:04:05.2843944Z 2025-05-07T20:04:05.2844231Z This constrains the platform tag to "manylinux_2_35_x86_64". In order 2025-05-07T20:04:05.2844767Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:04:05.2845310Z wheel from source on a system with earlier versions of these 2025-05-07T20:04:05.2845737Z libraries, such as a recent manylinux image. 2025-05-07T20:04:05.3780709Z 2025-05-07T20:04:05.3780727Z 2025-05-07T20:04:05.3781698Z ################################################################################ 2025-05-07T20:04:05.3783144Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:04:05.3784123Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:05.3784488Z 2025-05-07T20:04:05.3797466Z -rw-r--r--. 1 root root 262M May 7 20:03 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:05.3798833Z 2025-05-07T20:04:05.3799152Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:04:05.3800506Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:05.3801707Z 2025-05-07T20:04:05.8715928Z 9c33dfdca9bf43b7ae1ee715c0be062b09f6ee76 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:05.8717581Z 2025-05-07T20:04:05.8718375Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:05.8719525Z 2025-05-07T20:04:07.0155799Z 51bf70f94d6257344247f790f4fe8196a6607cbc2137484c9ff2af5272981f5d dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:07.0157825Z 2025-05-07T20:04:07.0158577Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:07.0159675Z 2025-05-07T20:04:07.4556605Z 997d075e1ac42497b36ff5120d82cca7 dist/fbgemm_gpu_nightly-2025.5.7-cp311-cp311-manylinux_2_28_x86_64.whl 2025-05-07T20:04:07.4558089Z 2025-05-07T20:04:07.4558492Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:04:07.4671679Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:04:07.4672039Z with: 2025-05-07T20:04:07.4672414Z name: fbgemm_default_x86_clang_py3.11_cu11.8.0.whl 2025-05-07T20:04:07.4672936Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:04:07.4673277Z if-no-files-found: error 2025-05-07T20:04:07.4673593Z compression-level: 6 2025-05-07T20:04:07.4673862Z overwrite: false 2025-05-07T20:04:07.4674152Z include-hidden-files: false 2025-05-07T20:04:07.4674435Z env: 2025-05-07T20:04:07.4674712Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:04:07.4675045Z BUILD_ENV: build_binary 2025-05-07T20:04:07.4675373Z BUILD_TARGET: default 2025-05-07T20:04:07.4675637Z BUILD_VARIANT: cuda 2025-05-07T20:04:07.4675931Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T20:04:07.4676216Z ##[endgroup] 2025-05-07T20:04:07.4680308Z ##[command]/usr/bin/docker exec f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:04:07.8682238Z With the provided path, there will be 1 file uploaded 2025-05-07T20:04:07.8684357Z Artifact name is valid! 2025-05-07T20:04:07.8685126Z Root directory input is valid! 2025-05-07T20:04:07.9438256Z Beginning upload of artifact content to blob storage 2025-05-07T20:04:08.5780340Z Uploaded bytes 8388608 2025-05-07T20:04:08.8885001Z Uploaded bytes 16777216 2025-05-07T20:04:09.1934053Z Uploaded bytes 25165824 2025-05-07T20:04:09.4688773Z Uploaded bytes 33554432 2025-05-07T20:04:09.7964146Z Uploaded bytes 41943040 2025-05-07T20:04:10.1032639Z Uploaded bytes 50331648 2025-05-07T20:04:10.4363564Z Uploaded bytes 58720256 2025-05-07T20:04:10.6956139Z Uploaded bytes 67108864 2025-05-07T20:04:11.0381581Z Uploaded bytes 75497472 2025-05-07T20:04:11.3552865Z Uploaded bytes 83886080 2025-05-07T20:04:11.8115838Z Uploaded bytes 92274688 2025-05-07T20:04:12.0945828Z Uploaded bytes 100663296 2025-05-07T20:04:12.3169746Z Uploaded bytes 109051904 2025-05-07T20:04:12.6235779Z Uploaded bytes 117440512 2025-05-07T20:04:12.9204222Z Uploaded bytes 125829120 2025-05-07T20:04:13.2165556Z Uploaded bytes 134217728 2025-05-07T20:04:13.4773601Z Uploaded bytes 142606336 2025-05-07T20:04:13.7872032Z Uploaded bytes 150994944 2025-05-07T20:04:14.1108800Z Uploaded bytes 159383552 2025-05-07T20:04:14.4476590Z Uploaded bytes 167772160 2025-05-07T20:04:14.7436383Z Uploaded bytes 176160768 2025-05-07T20:04:15.1224746Z Uploaded bytes 184549376 2025-05-07T20:04:15.4162032Z Uploaded bytes 192937984 2025-05-07T20:04:15.7699976Z Uploaded bytes 201326592 2025-05-07T20:04:16.0830202Z Uploaded bytes 209715200 2025-05-07T20:04:16.3560617Z Uploaded bytes 218103808 2025-05-07T20:04:16.6075929Z Uploaded bytes 226492416 2025-05-07T20:04:16.9775270Z Uploaded bytes 234881024 2025-05-07T20:04:17.3800615Z Uploaded bytes 243269632 2025-05-07T20:04:17.5403956Z Uploaded bytes 251658240 2025-05-07T20:04:17.8877416Z Uploaded bytes 260046848 2025-05-07T20:04:18.1754940Z Uploaded bytes 268042496 2025-05-07T20:04:18.1910433Z Finished uploading artifact content to blob storage! 2025-05-07T20:04:18.1912696Z SHA256 digest of uploaded artifact zip is 1e9881c86fd1d0d3a51ea88f39661a3922c1e93107ac1ec5d75f39e72a1f317c 2025-05-07T20:04:18.1914413Z Finalizing artifact upload 2025-05-07T20:04:18.3006948Z Artifact fbgemm_default_x86_clang_py3.11_cu11.8.0.whl.zip successfully finalized. Artifact ID 3081409868 2025-05-07T20:04:18.3007973Z Artifact fbgemm_default_x86_clang_py3.11_cu11.8.0.whl has been successfully uploaded! Final size is 268042496 bytes. Artifact ID is 3081409868 2025-05-07T20:04:18.3017736Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081409868 2025-05-07T20:04:18.3271628Z Post job cleanup. 2025-05-07T20:04:18.3277689Z ##[command]/usr/bin/docker exec f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:04:18.6276690Z [command]/usr/bin/git version 2025-05-07T20:04:18.6313968Z git version 2.47.1 2025-05-07T20:04:18.6347608Z Copying '/github/home/.gitconfig' to '/__w/_temp/bad92604-8d76-432d-91c0-f4f5a70f2b16/.gitconfig' 2025-05-07T20:04:18.6355810Z Temporarily overriding HOME='/__w/_temp/bad92604-8d76-432d-91c0-f4f5a70f2b16' before making global git config changes 2025-05-07T20:04:18.6358203Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:04:18.6360271Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:04:18.6398073Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:04:18.6425374Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:04:18.6707785Z Entering 'external/asmjit' 2025-05-07T20:04:18.6755281Z Entering 'external/composable_kernel' 2025-05-07T20:04:18.6813504Z Entering 'external/cpuinfo' 2025-05-07T20:04:18.6860783Z Entering 'external/cutlass' 2025-05-07T20:04:18.6915472Z Entering 'external/googletest' 2025-05-07T20:04:18.6961989Z Entering 'external/hipify_torch' 2025-05-07T20:04:18.7028028Z Entering 'external/json' 2025-05-07T20:04:18.7087512Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:04:18.7108003Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7114261Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:04:18.7137808Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:04:18.7400989Z Entering 'external/asmjit' 2025-05-07T20:04:18.7451572Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7483725Z Entering 'external/composable_kernel' 2025-05-07T20:04:18.7534250Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7575178Z Entering 'external/cpuinfo' 2025-05-07T20:04:18.7609547Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7644126Z Entering 'external/cutlass' 2025-05-07T20:04:18.7690452Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7735496Z Entering 'external/googletest' 2025-05-07T20:04:18.7768650Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7803452Z Entering 'external/hipify_torch' 2025-05-07T20:04:18.7836550Z http.https://github.com/.extraheader 2025-05-07T20:04:18.7875235Z Entering 'external/json' 2025-05-07T20:04:18.7909960Z http.https://github.com/.extraheader 2025-05-07T20:04:18.8061573Z Stop and remove container: 0f0548cb111f43f4969935384500e226_amazonlinux2023_99f7db 2025-05-07T20:04:18.8066446Z ##[command]/usr/bin/docker rm --force f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 2025-05-07T20:04:19.6110800Z f3f10d3a0ffb2e1de5d2baa9c8ea87218a6000bbc284942b770d140db8fa8e81 2025-05-07T20:04:19.6143577Z Remove container network: github_network_5669e03931344ab5bb72aa11fef66996 2025-05-07T20:04:19.6147976Z ##[command]/usr/bin/docker network rm github_network_5669e03931344ab5bb72aa11fef66996 2025-05-07T20:04:20.5229843Z github_network_5669e03931344ab5bb72aa11fef66996 2025-05-07T20:04:20.5266225Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:04:20.5287045Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:04:20.5293250Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:04:20.5293715Z ##[endgroup] 2025-05-07T20:04:20.5397918Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:04:30.5963267Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:04:46.6127075Z Cleaning up orphan processes